1 



(19) 





(12) 



EuropSisches Patentamt 
European Patent Office 

Office europeen des brevets ( 1 1 ) 

EUROPEAN PATENT APPLICATION 

(51) Int. CI. 6 : H04N7/26 







EP 0 891 094 A2 



(43) Date of publication: 

13.01.1999 Bulletin 1999/02 

(21) Application number: 98202243.6 

(22) Date of filing: 28.07.1995 



(84) Designated Contracting States: 

AT BE CH DE ES FR GB IE IT LI NL PT SE 

(30) Priority: 29.07.1994 GB 9415413 

07.06.1995 GB 9511569 

(62) Document number(s) of the earlier application(s) in 
accordance with Art. 76 EPC: 
95305308.9 / 0 695 095 

(71) Applicant: 
Discovision Associates 
Irvine, CA 9261 4 (US) 

(72) Inventors: 

• Smith, Colin 
Bristol, BS1 8 4XF (GB) 



• Finch, Helen Rosemary 
Gloucestershire GL12 7ND (GB) 

• Claydon, Anthony Peter John 
Bath, Avon BA2 6BZ (GB) 

(74) Representative: 

Vuillermoz, Bruno et al 
Cabinet Laurent & Charras 
B.P. 32 

20, rue Louis Chirpaz 
691 31 Ecully Cedex (FR) 

Remarks: 

This application was f iled on 03 - 07 - 1 998 as a 
divisional application to the application mentioned 
under INID code 62. 



(54) Video decrompression 

(57) A parallel Huffman decoder, comprising : 

an input, accepting a mixed data stream comprising 
Huffman coded variable length codes, fixed length 
codes, and data tokens, wherein all complete Huff- 
man codes are received in a single cycle of opera- 

tion ; i . . 

a selector operative on said data stream, wherein in 

a first mode of operation variable length codes are 

provided to said address lines of a Huffman Code 

ROM, and in a second mode of operation data 



tokens and a signal representative of a length of 
fixed length codes is output by the decoder without 
reference to sad ROM ; 

a pair of input registers for receiving Huffman coded 
data, both of said registers directing input in parallel 
to said selector ; and 

a Huffman Code ROM for receiving input from said 
selector and another ROM table select input; said 
ROM providing a first decoded data output, said 
selector being responsive to a second output of 
said ROM. 
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Description 



This application claims priority under British Application Serial No. 9415413.5 filed July 29 1994 

ci„„ I, P T enl ' nVen,i0n fe,a,eS 9enera,ly t0 8 new and im P roved system for decoding a plurality of audio and video 
signals and more particularly, to a new and improved system for decoding a plurality of MPEG audfo aSvS^S? 

A senal pipel.ne processing system of the present invention comprises a single fcwSU bus ^eJlTSnn 
unique and special interactive interfacing tokens, in the form of control tokern and daTa tokens to^a pluraK 
adapfve decompress,on crcuits and the like positioned as a reconfigure pipeline processor P * 

United States Patent 5.1 1 1 .292 discloses an apparalus for encoding/decoding a HDTV signal for e a terr^triai 
transm,s SI on mc.udes a priority selection processor for parsing compressed vWeo cii^tS^J^^S 
a SiLnvT, Z ,r r Sm ' S ! i0 " A circuit responsive to high definition video m^S^S^Z- 

archical.y layered codewords CW representing compressed video data and associated codewords f dehSnt he uZ, 
of data represented by codewords C W. The priority selection processor, responsive to* S cwand T co^ 

the ?e U s^iv° SlTZTsZ °' d3,a ^ d6termineS tHe n "^ er * WtS " - = S22£SS£ 

me respective channels Thereafter the processor parses the codewords CW into high and low orioritv codPwnrri 

" ^ Pri ° rity COdeW ° rd S6quences COrres P° nd to compressed v^oTta of^alS 

greater and lesser importance to image reproduction respectively relatively 

inn, h^^ Sta,6S Pa,enl N ° 5 1 22 875 diSCl0SeS an a PP^atus for encodingAJecoding an HDTV signal The apparatus 
C °r reSS, ° n C " CUrt '**x>™™ 10 hi 9h definition video source sjna.s to? providing hSh^l^vTnS 
represen,,n 9 compressed video data and associated codewords T. defining the ^oSa reon? 
sented by the codewords CW A pnonty selection circuit responsive to the codewords CW and Tpa^TO the codewords 
CW into high and low pnonty codeword sequences wherein the high and low priority codeword stances c^S^S 
to compressed video data of relatively greater and lesser importance to image reproduction reSSve? A^transo^rt 
processor, response to the h.gh and .ow priority codeword sequences, forms high and low prior%tantport SocS 2 
2 iS£L°Z£T COd T ^ respective, y Each fra -P°* b'°ck includes a header, codeword ^ CW aTZ d^ 
SLSUL^: The / esp h ec, r K ,rans P 0fl b,ocks ara *PP"«* to a forward error check circuit for applying add^alerSr 

s^^£Er ,ow priority data are app,ied to a modem — 

s^^Hf' , , h K OS !^ ki " ed in 1he 3rt h3Ve recoanized 3 '° n 9 'e't need for a new and improved video decompression 
system obv,at.ng the deficiencies of the prior art systems. The present invention clearly fulfills this ™ eCOmpress,on 

Figure 1 illustrates data flow through a preferred embodiment in the present invention- 
Figure 2 shows an example of a 1 3 bit word used to address 8 bit data in a 64 X 32 RAM • 
Figure 3 is a functional block diagram of a Register file in the present invention- 
Figure 4 illustrates data flow in a register file as shown in Figure 3 

Roure t S I hTu "M'?" 9 addr6SS deC0din 9' in accordance with the present invention- 

Figure 6 is a Mock diagram of a Microcodable State Machine, in accordance with the present invention- 

XI ^T S 8 Wid,h ^ in 8CCOrdanCe With ,he Dresent ««J for aadreSgTrS having an 

address field, a substitution field and a substitution header; 9 

Figure 8 is a block diagram of one example of an Arithmetic Core in accordance with the oresent invention 
Sput dT ,eS b3SiS S,6PS in 3 m6,h0d - in aCC ° rdanCe With the ^ ™^ ££<ZSan IDCT 
Z^^^EE? ,h6 C0mbin6d - >• - a " «CT system, in 

^JI^VS^S! 0 * diagram * m intesrated circuit 0131 comprises the main sys,em components °' *• 

2^ JfLSl Fi9Ure 1 2b w 6n " ,09ether 3re 3 b ' OCk diaSram ° f 3 P'Wessing circuit corresponding to one of 
the main system component; for ease of explanation, these figures are referred collectively as Figure 12 

cTtio. SS^lS?."!" 1 f 'IT 9 di39ramS WhiCh i,IUSlrate ,he rala «^ beteen timing and 
coniroi signals in the IDCT system of a prefened embodiment" 

Figure 14a and Figure 14b taken together are a block diagram of a common processing circuit in the IDCT svs 
tem.-for ease of explanation, these figures are referred to collectively as Figure 14- * 
Figure 15a. Figure 15b. Figure 15c and Figure 15d taken together are a block diagram of a post-processina circuit 
wh^h conresponds to another main component of the system and are refened coSectfvely asl^lT 
Figure 16 is a block diagram, in accoidance with the present invention illustrating an IDCT havino a twin data 
stream, a transpose RAM and an improved buffer; 9 ala 

Figure 17 is a block diagram showing in further detail the 1-D IDCT system shown in Figure 16- 
Figure 18 is a block diagram showing greater detail of the transform system as shown in Figure 17 
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Figure 19 is a block diagram showing in greater detail the input buffer shown in Figure 18; 

Figure 20 is a simplified block diagram of a pre-processing circuit "PREC". in accordance with the present .nven- 

Figure 21 is a block diagram illustrating a common processing circuit "CBLK" found in the IDCT; 
Figure 22 is a block diagram of a post-processing circuit "POSTC"; 

Figure 23 is another illustration of the post-processing circuit shown in Figure 22; inwontinn . 

Figure 24 is a block diagram depicting a round and saturate block, in accordance w.th the present invention. 

Figure 25 is a block diagram of an output buffer in the present invention; 

Figure 26 is a block diagram of a control shift register, in accordance with the present invention; 

Figure 27 is a block diagram of a control shift register decode in the present invention; 

Figure 28 depicts a control shift register and an input control butter; 

Figure 29 illustrates a control circuit for a T2 data stream; 

Figure 30 shows data in a counter for a T1 data stream; 

Figure 31 depicts data in a counter for a T2 data stream in the present invention; 

Figure 32 is a timing diagram showing the initialization of the IDCT and associated circuitry 

Figure 33 is a timing diagram showing the interleaving of T1 and T2 data; 

Figure 34 is a timing diagram illustrating slippage and recovery of T2 data; 

Figure 35 is a timing diagram depicting a flushing operation of the IDCT and associated circuitry in the present 

invention; . 
Figure 36 illustrates start-up of the system, in accordance with the present invention; 
Figure 37 depicts slippage and recovery in the early stages of interleaving T1 and T2 data; 
Figure 38 illustrates another preferred embodiment of the IDCT system shown in Figures 1 6 through 37; 
Figure 39 shows MPEG information streams being demultiplexed, in accordance with the present invention, into 
elementary streams containing data and timestamp information; ^ rnni 
Figure 40 depicts a first embodiment of an elementary stream timestamp error determ.nat.on and time synchroni- 
zation system, in accordance with the present invention; 

Figure 41 illustrates a second embodiment of an elementary stream timestamp error determination and time syn- 
chronization system, in accordance with the present invention; 

Figure 42 shows a third embodiment of an elementary stream timestamp error determination and time synchroni- 
2ation svstem in accordance with the present invention; 

Figure 43 depicts a first embodiment of a video timestamp error determination and time synchronization system, .n 

arnordance with the present invention; , . _ » 

fS 44 iLstrates a second embodiment of a video timestamp error determination and time synchronization sys- 

iem in accordance with the present invention; . 

Figure 45 shows the second embodiment of a video timestamp error determination and time synchronization sys- 
tem as shown in Figure 44 and operating at 30Hz; 

Figure 46 shows timestamp information flow through the system of the present invention; 

Figure 47 is a block diagram illustrating synchronization time information being processed by a microprogrammable 
state machine; 

Figure 48 is a block diagram illustrating a f irst preferred embodiment of the present invention; 
Figure 49 is another block diagram illustrating the first preferred embodiment of the present invention; 
Fioure 50 depicts a second preferred embodiment of the present invention; 

Figure 51 illustrates a detailed method of addressing used by the second preferred embodiment, in accordance 

with the present invention; . - . , *u #u« 

Figure 52 is a block diagram showing an apparatus for decoding Huffman VLCs. in accordance with the present 

Rgu'reSS is a schematic diagram showing the overall structure of the parallel huffman decoder of the present inven- 

Sure 54 is a schematic diagram illustrating a ROM adapted for decoding parallel huffman codes; 

Figure 55 illustrates a first embodiment of a ROM adapted for decoding parallel huffman codes; 

Figure 56 illustrates a second embodiment of a ROM adapted for decoding parallel huffman codes; 

Figure 57 depicts a third embodiment of a ROM adapted for decoding parallel huffman codes; 

Figure 58 is a block diagram illustrating the primary system component of one embodiment of the present .nvention; 

Figure 59 is a block diagram depicting the start code detector of the present invention; 

Figure 60 is a block diagram showing the parser of the present invention ; 

Figure 61 is a block diagram depicting the primary components of the spatial processmg circuitry of the present 

invention; . . „ , . ,. 

Figure 62 is a block diagram illustrating the display circuitry, in accordance with the present invention; 
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Rgure 63 fllustra.es one embodiment ot timestamp management, in accordance with the present invention- 

fZt £ ST? TT & en1b<XJimen, ° f ,imeS,am P management in the present invention 

Roue £ I a 22; f 9 ™ dGpiCtin9 ,he hardware ««P™«l. of the system of thTpSsent invention- 

Rgure 66 . a Woe* diag ram proving an overview of the system components of the JtZZZXESL present 

Sure £ flt a sX! Arithmetic core of the present invention; 
ngure wj illustrates the ALU of the present invention 

Rgure 69 depicts a register file, in accoidance with the present invention- 

fZI 7? tTT J"' ""J" 9 10 indePendent "» in P^t invention- 

Rgure 71 illustrates frame-based prediction wherein vectorfl] = o and vectorfOl - 0 
Rgure 72 depicts frame-based prediction wherein vectorfl] = o mS^Sm^v 
Rgure 73 shows frame-based prediction wherein vectorf ] = i and vSoi - 0 ' 
Rgure 74 i.lustra, e s frame-based prediction wherein ve^i] . i a£2&* , • 

Rgure 84 shows still another prediction mode- 

rZl S !h S,ra,6S an ° ther PrediCt, '° n mode " in accordance with the present invention- 
Rgure 86 shows another prediction mode of the present invention; ' 

ol £SSST m illUStratin9 ^ ° r9aniZati ° n ° f the Vari ° US SyS,em co ^- te - ^'av system 
Rgure 88 depicts a 4:3 filtering operation; 
Rgure 89 depicts a 3:2 filtering operation- 
Figure 90 illustrates a 2:1 filtering operation of the present invention- 
Rgure 91 shows a three tap filter used in the present invention ; 
Rgure 92 illustrates the repetition of erroneous pels- 
Figure 93 depicts the ffled.id signal of the present invention 

Rg'ure It £^£5!^ h aCC0 ' dance *• P^en, invention; 

Rg'ure £ £££ 2 ^SJSSSS^St ^ * »• P'-ent invention; 

Figure 97 shows a ^S^^ »— " invention; 

Figure 98 .llustrates border generation in the present invention- * 
Rgure 99 depicts picture cropping, in accordance with the present invention- 
Figure on ,s a block diagram illustrating the present invention as a chfp 
Sure 02 EES ,h r ySC,0Ck ^ ui —ts of the present invention 

Sure S ^JS^SS^^ - ™ «» -e present invasion; 

Rgure 104 shows a FLUSH token of the present invention- 
Rgure 1 05 illustrates the timing of the coded data interface- 

Rgure 08 .llustrates output timing in 8 bit mode in the present invention- 

Rour'e ?o 2S3L •H timin ! L° f thS vW, ° 0UlpUt interface in ***** ^ntion; 
Rgure 10 depicts video output mode signals, in accordance with the present invention- 
Rgure 1 1 1 shows horizontal timing in the present invention- invention. 
Figure 112 shows the vertical timing for a 525 line system- ' 
Rgure 113 depicts the vertical timing for a 625 line system- 

Rou re a rt^cf^ mKb0n confi 9 uratton ! " *e present invention; 

Rgure 1 18 d^cts a two SDRAM connection configuration, in accordance with the present invention; 
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Figure 1 1 9 illustrates a three SDRAM connection configuration 

Fiaure 120 is a flow chart depicting the flag_picture_end operation, in accordance with the present invention; 
Figure 12? Is 1 f low chart showing the start_code_search operation, in accordance with the present invention; 
Figure 122 shows timestamp modification, in accordance with the present invention 

, ra 1 v\ iih Urates the read timing for the microprocessor interlace; and 
F igu e 24 S^SLg for the microprocessor interlace. In the ensuing description of the practice of the 
Sor UN ^following terms are frequently used and are generally defined by the following glossary: 

GLOSSARY 

° mncK An 8-row bv 8-column matrix of pels, or 64 DCT coefficients (source, quantized or dequantized). 

5^S^3wSSSn)= A matrix, block or single pel representing one of the two color difference signals 
r^tSTrSfma^colors in the manner defined in the bit stream. The symbo.s used for the cofor difference s.gna.s 

t« rnnF^RF^RESENTATlON: A data element as represented in its encoded form. 

CoUS S.DEO fS ^TREA„ : a coded representation of a series of one or more pictures as defined in th.s specif ,ca- 

CODED ORDER: The order in which the pictures are transmitted and decoded. This order is not necessarily the same 

x> cyMPON^fmatrix. block or single pe. from one of the three matrices (.uminance and two chrominance) that make 

up a picture. . . . 

COMPRESSION: Reduction in the number of bits used to represent an item of data. 

™ r g T^™X~*"^ « that reads an inp * — bite,ream and produces 

25 ^y^S^^St^ the decoded pictures are displayed. Typical., this is the same order in which - 

XSKKS^^ * •* ^ ication - *- reads a stream 01 input pictures or audio 

^SSSS^SSS^Tr^ bloc'k or single pel representing a monochrome representation of the s,gna. 
enc Trelated to the primary colors in the manner defined in the bit stream. The symbol used for luminance is Y 
M^CRO BLOC let Thef our EU>y 8 blocks of luminance data and the two (for 4:2:0 chroma format) four (for » 
fo^natt or eight (for 4 4:4 chroma format) corresponding 8 by 8 blocks of chrominance data coming from a 16 by 1 6 sec- 

35 ttoTo f t^e!u^na?ce c 0 mp Onent of the picture. Macroblock is sometimes used to refer to the pel data i and somet.mes . 

3 5£ StZSZSZ of the pel va'lues and other data elements defined in the ***** 
defined in this part of this specification. To one of ordinary skill in the art, the usage is clear from the context 
MOTION COMPENSATION: The use of motion vectors to improve the efficiency of the pred.ct.on of pel values. The 
pS us* rS vectors to provide offsets into the past and/or future reference pictures contammg previously 

40 decoded pel values that are used to form the prediction error signal. n(ft .^, rnm th0 rrwriinatP 

MOTION VECTOR: A two-dimensional vector used for motion compensation that prov.des an offset from the coordinate 
nncitinn in the current oicture to the coordinates in a reference picture. 

SSSSir^^ of a macrcblock or picture that uses information both from itself and from macroblocks 

and pictures occurring at other times. 

45 RCtJre" SotrcTcoded or reconstructed image data. A source or reconstructed picture consists of three rectangular 
Sees of S^Tnumbers representing the luminance and two chrominance signals. For progressive video, a picture ,s 
SeS to a frame, while for interlaced video, a picture can refer to a frame, or the top field or the bottom f ,e.d of the 

so l7^^Z°:^tT^o f to provtfe an estimate of the pel va.ue or data element currently being decoded. 
RECONFIGURABLE PROCESS STAGE (RPS): A stage, which in response to a recognized token, reconfigures .tself 
to perform various operations. 

SLICE: A series of macroblocks. . 

TOKEN: A universal adaptation unit in the form of an interactive interfacing messenger package for control and/or data 

55 STARTCODES [SYSTEM AND VIDEOl: 32-bit codes embedded in a coded bitstream that are unique. They are used 
for several purposes including identifying some of the structures in the coding syntax. «,^, 1Dn , 
VARIABLE LENGTH CODING; VLC: A reversible procedure for coding that assigns shorter code-words to frequent 
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events and longer code-words to less frequent events 
VIDEO SEQUENCE: A series of one or more pictures. 

Brief ly, and in general terms, the present invention provides a new and improved method and apparatus oarticularh, 
adapted for use .n a two-wire pipeline system having various control and DATA tokens The i^E^ rtSS? 

' SJZZSZ^ 1* Det T- 3 v ; deo Parser incorporatin9 a Huffman Decod - 

State Mach.ne (MSM , an Inverse D.screte Cosine Transform (IDCT). a synchronous DRAM controller wtth^n a^ori 

« a ^o M °/ e im P 0r,anUy - various embodiments of the invention may include an MPEG video decompression method and 
o apparatus ut.hz.ng a plurality of stages interconnected by a two-wire interface arran ae d^Tn!n! ?n D T 

machine. Control tokens and DATA Tokens pass over the single £Z See f^ng bot hTn ol SSSZ 

Bv wa / of example, and not necessarily by way of limitation, the present invention may include amono its various 

ESTr" aPPa . ra,US SynChr ° ni2in9 fime havi "9- a S ^P ^ determining presentation toe T55££!£ 
or ,n,t.ahz,ng system time in a first circuit, a first time counter in communication with the dock ife ^ce fo^epta 

Z STrHV c,rc f a : d I second time inilia,ized Dy dock re,erence ir > • ■^^^JSSSl 

w^h the clock reference for keep.ng system time in the system decoder and a second time counter initialized bv the 

usina^'dXl^ln 6n v 'r" 6 inVenti ° n indUdeS a " apparatUS for ^hronizing a first circuit and a second circuit 
Tnr Zf* 5? e,e / en f <0r ,n ' t,al,z,n 9 system time in the first circuit, a first circuit having a time counter in comrnuiSi 
tan wrth the clock reference for keeping system time, a first elementary stream time counter mTe f rst ci^Tttor ^ 
v.d.ng elementary stream time. The first circuit is adapted to receive a time stem anri r,L c,r , cu,t for p , r °- 

curt ,s adapted to rece.ve synchronization time from the first circuit and has a second ^nL^^^^Zr 
.n synchronization with the first elementary stream time counter for providing a local copy ^^SSSS^Sl 
and for deferring a t.ming error between the system time and the time stamp by compart I^S^S^Tr^ 
he .oca. copy of elementary stream time. In this way. the clock reference signTdoes^S KISSES to 
the second circuit in order to determine the timing error. passed directly to 

In another embodiment of the invention, an apparatus for synchronizing a first circuit and a second circuit has a 

Tc^TZlSJTlT 9 sy5, T " 7 in the ,irsl circyt - ^ ,irst circuit has a lime l^SSSS^ 

the clock reference for keeping system bme. and a first video time counter for providing video decodina time The first 

SSSm "t^ and subtractin9 sys,em time - ™ e second £5E2K£E 

synchron.zat.on time from the f ,rst crcurt and has a second video time counter in synchronization with the first video 

SiT *7 P ^ Win9 K ' OCal C0Py ° f Vk,e ° d6COdin9 time and for determining alin erro betwle XS^SS 
and the v.deo t>me stamp by comparing synchronization time to the local copy of video decodina ta ZlZ Z 
clock reference signa. does not have to be passed directly to the second 1 ^SS^JSJ?^ 

in„ a ^r? ePt '^"l 00 a,S ° inC,Ud6S 3 meth0d for provk,in 9 i"fonnation by providing aTdeo data sZm hav- 
inga time stamp earned m packet header wherein the time stamp refers to the first picture i ?to pSJS STta . ft* 
next step a register is provided having a flag used to indicate valid time stamp information whi^Ttake^ ^ from , hp 
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^ • a u mflnfaru otrpam decoder is restricted to 1 6 bits. Furthermore, there is an appara- 
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StfwTpa^ellS tKTJ?«or is indeed and indicates whan the compared value is outs.de acceptable 

Pa Tn 'alternative embodiment of the invention includes an apparatus for using a system decoder and a ^decoder 

tk ^IZ T^nZr * adanted to accept MPEG system streams and demultiptexmg v.deo data and the v.deo time 

SmSm SI ?he syttem decSer has a first time counter representative of system time. The video decoder 
stamp from the stream. The sy«ema jn s hronization Wlth the first time 

accepts the v.deo d ^^ X ^^^ P ^ for accepting the video data at a substantially constant rate and 
C TT^So d^ *£o time stamp. The video decoder while decoding a pic 

outputtrng the video data at a varymg ^ratea P 9 the decoded picture with the second time counter to 

ture «rom th^o^ SLSSffrhlS^ a method for determining a timing error between a first circuit and 
a S !XSS^SS^fSSS^ a systemdime (SY). a time stamp (TS). and an elementary ^eam time 
?ptT ottairZ synchronization time (X) by using the elementary stream time (ET). the time stamp (TS). and the system 
S'(SY Mn accSnce "th the equation X-ET + TS-SY . providing synchronization time (X) to the second M and 
gTneS a etemeSary stream time (ET2) and obtaining a timing error by using 

anSTn accordance with the equation ET2-X. Hence, the first circurt can be t.me synchron.zed w.th the second c.rcu.t 

—SSr 1 SSSS- between a first circuit and a second circuit has 

Mnoinei meu.^ " B1W1 /XQX anH an : nitia i t ; me /m obta n ng a synchronization time (X) by using 

F^JI^'S'S ^^^JSZS^^ X^TS-l" providing synchronization time 
X rthe secorS c IcuH and . Synchronized elementary stream time (ET) and obtaining a timing error by 
!2ng fynchr^ed Se (" and in accordance with the equation ET-X. In this way. the first circuit can be « synchro- 

time (SY). in a « ord ^^ to the lideo decoding time (VT) in 

parser microprogrammable state Machine (MSM). and can sustain a high t^oughput 

m one embodiment of the invention a code lookup technique is employed to decode Huffman codes to acmeve per 
f ormance !Z££Sl and to handle the second MPEG-2 transform coefficient table which is irregular ornonwn. 
oTin n*uS See of the invention also facilitates decoding certain more complex components tantaM?n 
a simte Se whhout the assistance of an external controller. Examples of such complex components are Es»pe- 

l^JSSSSX^y^ — Moti °" Vector de,tas - an of which are present in the as ° 0mb 

VL Td 3 eSdTa 0 v!3 S in P ut is first loaded into the two input data registers handling most significant and least signHicant 
data A seSor is used to align the beginning of the next VLC with the ROM input. Hen 

tor outouts the too 28 bits of its 59-bit input and the top 16 bits of these are passed to a Huffman Code ROM. For sub 
seauenTvlS the se.eJto? effectively shifts the input according to the total count of bits decoded thus far. the count . 
m^nt^bv eddino the size of each VLC as it is decoded, to a running total. The various word widths are a result of 
r^lte ^e decoded, which is the 28-bit MPEG-1 Escape Coded Coefffcient. and the max- 

imum VLC size which is 16 bits (DCT coefficient tables). yDCP 
The ?aWe select" input is used to select between the various different Huffman code tables reared by MPEG. 
t£ e ROM haT ^ddresseJ which are controlled with a selector/shifter. The ROM performs a VLC table mdex calcu- 

lation followed bv the index-to-data operation that yields decoded data. 

is a contertaddressabfo memory (CAM) operation wrth "don't care" matching -mplemented 
to tatf e to J Huffman codes which form the presented data. Since the index generation is performed in a look-up man- 
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^rRO^^Tir"^ thSre iS n ° reS,riCti0n ,0 hand "' n 9 tab '<* which are canonica. 

ROM I also has entries to Entity i„e 9 a. VLC %£££^»£^£^ MPEG code tables, the 

In another embodiment ot the invention a nmrprt,™ . 'Z, Z. 5 ° me ? oae taWes - 
number of bits to be used for addre^ng ^ialle S da.L 2? h ■ ' WOrd ^ ,ixed ^ havin 9 a «** 

is also a procedure for acWressinrml^t^^i L,^^ 9 . 3 ^ "f™ 9 ,ie,d and address There 
addressing data and having a subsWXd^j ■SSTrTT* i™" 9 ' ^ nUmber 0f bits ' t0 be for 
ing a state machine and an arithmetic^ ^ ^ 3 " 3PP3ratus *» dressing memory, includ- 

' f SK ^ for ™ g tSSZSXSZF * J"? ^ ^ 3 

field and an address field, providing the 3 infno STwIfh , T ,he " Xed W * dth WOrd wth a ^ defining 
defining the address field w,°h a pJa it oTbte SffiiJlSI ff, °" e bit t0 " n " 35 the ,ermina,ion 
in inverse relation to the size of "he Sable wlS data Z^naZl T "7^ °' ^ h the address field 

relation to the size of the variable width date a^d mat 2l2? r ^ T*" ° f b,tS ,he Wid,h de,irtn 9 ,ieW in *«*t 
varying the width of the width defSS ^a^SfeS ^ for 3ddres * n 9 -riab.e width data whi.e 

having at least one bit to serve as a teS m^kprTo^, ^ ! ' e3St 006 substitulion »he substitution field 
substiiution field to indicate substh tf2 S from T^^^ "* 3 " d Substitution f ie,d ' «he 

addressing variable width data whi.e inve* d^a^ a fixed width word for 

field. y ^ 9 Ine Wldth of 1,18 address field and the width of the substitution 

by pIovtdTa^y E£X i pre^n^dS^ 6 3 ™™> ma ' be *«»«•*•■ 

to be accessed to a least signUnTotttsS^^S^ 1 .t COmP ° Sed ° f P 3 * 3 ' words - ro,atin 9 ,he Partial word 
will be recognized as a partfal wo* nlS^S^S^ 0 ? IT™" 9 P3ft " ' he WOrd so that the acc ^ word 
is restored to its original posJn 9 P3rt °' the WOrd ' and ro,atin 9 *• word until the partial word 

fixed ^Sng 7^!^^^^ —V therein a word is proved with 

and address field In addition a o°<xedu7e for f Sr? 3ddress,ng var,ab,e width data, and having a width defining field 
bits, to be used for adSS dateS atSSSon 3 ^ h3vin 9 a fixed "umber «3 

The invention may also include a « Jr f ' eid and an address fie,d - "«y be used, 

termined fixed burst Z£i l o *e RAM Te RA^Sn^an 22." fT? M °' ^ ^ * 1655 ,h3n ,he P rede " 
ing from and writing to the RAM. the Z^^S^SSSST ^ ^ ^ read " 
ordenng N words to be read from or written to the RAM ■ 

each o- one ^ e „ ^ZZZE^^gg - »» "»* -«*«-«». 

(a) assigning each cell a particular one of the two bani« en *h=.f on ^ . 

are read from and written to one particular oaal Trtto*™Z ? K ^ at ! WOfdS associa,ed with that particular cell 
such that each cel. is assorted wS, a dSS M C h ? assi 9 nme "« of banks to cells being done 
or in the same column ■ ^ any border,n 9 cell which is also either in the same row 

(c) identifying which cells frTthe^im^Li ^Tl^ Ce " S the ♦wo dimensional grid pattern. 

(d) reading, from the £ baS ? JSSffSSS ^wS ^STS*; 3SSOda,ed ^ lhe Unali 9 ned 

f ied as containing data words alc^d' w^hJu^gne^S, ** °' *° °* "» 9rW " ide ^ 

^eSrr^^ «* — of .e ce„s in the grid pa«ern 

(f) repeating steps (d) and (e) until all the data words associated with lhe unaligned cel. have been read 
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the RAM interlace via a two-wire interface. 

The invention may also include a method to control the buttering of encoded video data organized as frames or 
fieldr^ Sod^nvolves determining the picture number of each incoming decoded frame, determmmg he 
ettt^preTentSion number at any time and marking any buffer as ready when its picture number « on or after the 

^TcSSTose concerned with the design, development, and utilization of systems for decoding video data 
h«Ji^rSirtzed the need for enhanced performance as accomplished by the various features of the present 
■Z^^^ZZtaLges of the present invention will become apparent from the following more detailed 
description taken in conjunction with the accompanying drawings. 

DETAILED DESCRIPTIONS 

The forthcoming "Detailed Description of the Invention" contains the following Sections: 

1) Detailed Description of the Invention for Memory Addressing 
Variable Length Fields Within a Fixed Width Word 
Using Fixed Width Word with Variable Length Fields to Perform 
Address Substitution 

Addressing Variable Width Data with a Fixed Width Word 
Microcodable State Machine Structure 

2) Deta^De^rip«on of the Invention for Transforming Data using a Common Processing Block 
Theoretical Background of the invention 

3) Detailed Description of Invention for Time Synchronization 

4) Detailed Description of the Invention for Asynchronous Swing Buffering 

5) Detailed Description of the Invention for Storing Video Information 

6) Detailed Description of the Invention for a Parallel Huffman Decoder 

The Huffman Code ROM 
Maximizing Throughput 
30 FLCs and Tokens 

Implementation 

7) MORE DETAILED DESCRIPTION 

■ 

DETAILED DESCRIPTION OF THE INVENTION 

35 As ah introduction to the illustrative embodiment(s) of the most general features of the invention and referring more par-. 
S vt^lT e drawings, the data flow through the preferred embodiment 200 of the mvention ,s showrv The 
embXem of the present invention is preferably implemented using a two-wire pipeline system having vanous control 
ar^DATXtokens The major elements of the system are a Start Code Detector 201 . a Video Parser 202 incorporating 

<o I HuSmTn DeTier 203 and a ^reprogrammable State Machine (MSM) 204. an Inverse Discrete Cosine Transform 
MDCtTJos ^^synchronous DRAM control.er 206 with an associated address generation unit 207. appropriate pred.c 
£n*£ri 2oTand display circuitry 209 which includes upsampling 2!0 and 21 1 and video timing 2 . 

^ipncation relates to similar subject matter disclosed in British Patent Application number 9405914.4 entrtled 
-Vd2 DSCSsion" filed on March 24. 1994. by Discovision Associates, and the latter application ,s specially 

45 "*ZZ2KZ£ ESSE aspects. features and subsystem areas of the present — - be 
referred to in greater detail below. In the drawings, like reference numerals denote like or corresponding parts through- 
out the various drawings and figures. 

so Detailed Description of the Invention for Memory Addressing 

In accordance with the present invention, a method and apparatus for addressing memory is described herein. In par- 
IS Se present invention provides for deferring variable width bit fields with fixed width words. More par^ulany the 
present invention provides a method of addressing variable width data wrth a f ixed w,dth word. In vanous torn* of he 
55 embodiment variable bit field is used to specify bits to be substituted into the word or to specfy an unused portion of 
thVwoidtn addressing variable width data with a fixed width word. In addition, the system of the present invention 
includes a microcodable state machine having an arithmetic core. 

The microcodable state machine is intended to be used for solving design problems where there is a need for ver- 
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"cSSsis^tr^ address 9eneration - s,ream ^ - 

variable length addresses to access i v^miS «' the * ddress,n 9 must with two different features: (1) 

tion. a RAM having a64x 32 brt ZSto KSl? ^ P ^ Substitulion - ,n the P^sent inven- 

8 bit. 512 x 4 bit. 1 9 024 x 2 b« o! SS " bTfoTrn^s ^ Part,a ' ^ ^ 64 X 32 »* 128 * 16 bit - 256 x 

Variable Length Fields Within a Fixprt wiwth 

iro vwttN. portions of « JSh™ „ SS^T ? , ParB °' *" ""*" ^ "^Wl "»»»<> *of dam. 

lns rsarsr ~Tjriis^? ,h * — 
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Table 1 



■ — - 

Bit number (hex) 


/ 
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4 


3 


2 


1 


0 


Fixed word 


w 
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w 
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F 
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w 


w 


w 


w 


w 


w 


F 


F 
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Table 2 



Bit number (hex) 


7 


6 


5 


4 


3 


2 


1 


0 


Held Define 


Fixed word 


w 


w 


w 


X 


X 


X 


X 


X 


1 


0 


1 


w 


w 


w 


w 


w 


w 


X 


X 


0 


1 


0 



series of continuation markers followed bv a terrnVnat?™,^ T . ?J »P«*ecl. from one end of the field, as a 
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Table 3 

Bit number (hex) 
Fixed word Continuation marker = 1 ; Termination marker = 0. 



7 


6 
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I 4 


3 


2 


w 


w 


w 


~0~ 


1 


~ 


w 


w 


w 


w 


w 


w 



1 

T 



Therefore, the advantages of the encoding method, in accordance with the present invention, are: 
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^ A reduction in the number of bits needed in the encoding. ^ 
2 ' A simplification in the decoding process is required since the need for a "x to 1 of decode of the "lield define 
shlnTn TaWe 1-2 that would normally be required is inherent in the encoding which .s already m the form of 1 of 

2 * a nd 

3. "The encoding is in a more intuitive form allowing the field defined to be more easily identified. 

Furthermore the use of this encoding method of the present invention can also be used such that the termination 
7* Tihe continuation marker are inverted to provide that the encoding of Table 3 resembles that of Table 4. 
Hence, the use of "1 " or "0" is used interchangeably throughout this application. 

Table 4 



Bit number (hex) 


7 

* 


6 


5 


4 


3 


2 


1 


0 




Fixed word Continuation marker = 1 ; Termination marker = 0. 


w 


w 


w 


1 


0 


0 




0 




w 


w 


w 


w 


w 


w 


1 


0 


0 



As oreviously identif «d the held encoded must be justified to either end of the word. Table 5 illustrates most s.gnrf.cant 

i e these are encoded in a similar way to least significant bit justif ied f ields except that the Md reaches 
iSXSUl "gnLnt bit (hereinafter MSB) towards the feast significant bit (hereinafter "LSB") up to and .nclud.ng 
the first termination marker The encoding of the fields shown in Table 5 are shown in Table 6. 
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Table 5 



Bit number (hex) 
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Fixed word 
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Table 6 



Bit number (hex) 


7 


6 


5 


4 


3 


2 


1 


0 




Fixed word Continuation marker = 1 ; Termination marker = 0. 


1 


~7~ 
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1 
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w 
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w 


w 


w 


w 


w 


w 



Moreover fields may be encoded from the least significant and most significant ends of the word ^muftaneouslyFor 
example, th" T two fields shown in Table 7 may be encoded as in Table 8. with the addition of just one b.t for each held as 
described previously. 

Table 7 



Bit number (hex) 
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2 
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Fixed word 
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w 


w 


F 


F 


F 


F 
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Table 8 



Bit number (hex) 



Fixed word Continuation marker = 1 \ Termination marker = 0. 




10 Us ' nq a Fixed Width WorW with VariahlP I *nath RplriQ tn P. 



Address 



addresses of a memory to m6th0d ° f the present inventlon «n be applied to the 

iable length field is us£ K 

Obaaaaaaaaaaaa encoded to have SS^SSS^'^ ^^L**** a 12 » ad *«* 

ObaaaaaaaOl 1 1 1 1 and Produce th"^ th * 12 b " value Obcccccccccccc would be 

address. yoaaaaaaaccccc. Table 9 shows the encoding for substitution into a 12 bit 
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No. Bits substituted 


B 


I A 




I a 


| a 


L ~ "~ 


I a 


I a 


2 


I a 


I a 


L ® 


a 


a i 


I 


! a 


I a j 


5 


a 


a | 


L 6 




a I 




a 


a j 


L ® I 


a I 


a 


9 


a 


a 


I ™ I 


a I 


a I 




a ] 


0 


li T 




1 



Address 



Table 9 
substitution 



a 



a 



0 



a 



0 



a 



1 1 



1 1 
1 1 
1 1 



Addressing Variable Width n^ a with a RvpH WiH t h 



encoding of the present invention can be used to 1S««E^ ' be Sh0wn how the variab,e ,ield 

To access a 64 * 32 bit t^^^Z'^tl TZZ'lt? *Z? ^ *• me ™'* 

the implementation erf this embodiment is a M x 32^bh mlmL ^ ? k reqU,r6S d,f,erent ,en9ths 01 address - i e - 
256 x 8 bits. 512 x 4 bits 1024 x 2 Ms or 20^ * ?£? T ^ be 3CCeSSed 35 64 * 32 bits, 128 x 16 bits, 

bit .ocations. whi.e 12 bte afe re^uirS Z Sdfess « o,' theTo^x 1 JStT" * «*- ^ ^ 64 * 32 
variable length and. in fact, the width of the aSeTs £S£S. add^SmS ** T" Can be ° f 

address can be defined within a fixed worH«w*hKw. s P ecmes me address format of the memory. Accoidingly. the 
the address and defines ^ 
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Table 10 



10 



15 



20 



Variable width addressing 








Data Width 
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1 
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a 


a 


a 
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a 


a ! 
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a 


a 


a 
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1 
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8 


0 
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16 
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0 
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32 


0 


0 


0 


0 


0 


1 


a 


a 


a 


a 


a 


a 



To allow indexing of the address, a portion of it can be substituted using the same method described P^Vj^ 
address substitution. The substitution portion (or field) of the address can be defined by a least s,gnrf.cant bit justrfied 
variable length field (The continuation marker -r : termination marker "0") that is superimposed on top of those shown 
in Table 1 0 Using an address of an eight bit word, as an example, Table 1 1 shows how to define thenumber of the leas 
significant bits to be substituted. The least significant bit added is the substitution indicator (marked w^. The general 
case of a Fixed width word for substitution is shown in Figure 2. 
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Address substitution 












Bits to be substituted 




A 
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a 
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a 
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0 




~0~ 


~cT 


~cT 
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a 
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"o~ 
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a 
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1 
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~0~ 


~o~ 




~T" 
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1 
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0 
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a 
~ 


~cT 
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~T~ 
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~cP 


~cT 
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a 




1 
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~P 
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0 
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~0~ 
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1 






~T~ 




" 7 




~o~~ 


~cT 
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IT 
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1 


1 
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~T~ 
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1 
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1 




1 


1 





in effect the substitute code is superimposed on top of the address that .s already coded From this coding rt can ,be 
seen that there are illegal addresses, most obviously 0x0000 and 0x3fff . In this case, a 0 must be in the bottom 9 bits 
to prevent substituting more than 8 bits and a "1" in the top 6 bits specifies an allowable access w.dth. If one of these 
errors is detected, the access is undefined, but the Register file contents will not be affected. 

In accordance with the present invention, the system for addressing and for accessing partial words in a reg.ster 

f ile is discussed below. . , , . . ... _ „„,,;_„. 

The conventional memory circuitry dictates that the memory must always be accessed at rt full w.dth. To achieve 
variable width accesses, a full (32 bit) width word is read. This full word is rotated until the partial word accessed .s jus- 
tified in the LSB The upper parts of the word are extended to the full width and then output. Extending may encompass 
padding with zeros or ones, sign extending, using the sign bit of a sign-magnitude number as the new MSB or any sim- 
ilar conventional method. Extending is dependent on the mode of operation. When the partial word is input to and writ- 
ten back into the memory, it is multiplexed back into the rotated full word, which is then rotated back and written into the 
array. Figure 3 shows these steps for the access of a 4 bit partial word in the fourth four bit word of the 32 bit word. 

To access or read partial words, such as the highlighted four bit word shown in row " 1 ■ 21 3 of Figure 3. the full width 
word must be rotated to place the partial word at the LSB. as shown in row "2" 214. As shown in row "3" 215. the four 
bit word is extended to create a full 32 bit word. This word can now be accessed. 
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) 



As shown in figure 3. a full width word that has been selected to be written back is truncated to the width of the nrin 
ma. partial word which is muitiplexed into the word shown in row 214. At the LSB SS^^? 
2UL The resulfcng word is rotated back in its original significance in the read word, this^shown V5 7 S full 
word can now be wntten back into the register file. S ™ ' 

The following list, therefore, summarizes the steps numbered in Figure 3: 



1 Full word read from memory; 

2. 12 bit rotated right puts partial word into the LSB; 

3. Extended to full word, then passed to output; 

4. The inputted partial word is multiplexed into rotated full word from (2V 
and " 



i. 1 £ bit "T^" 16,1 PU,S fU " W ° fd ba( * 10 ° ri9inal S,a,e ,0 be written - The to e accesses suggests the data flow 
structure of the memory tha . is shown in Figure 4. The numbers in the structure refer to the above teS and to Figure 

^J* 1 ? "i! m0ry add ' eSS muS ' be deCOded ,0 contro ' the above structure. It should be recognized that the MSB of anv 

£ S wo* adorJ 3 " h* i9ni,iCanCe WitH referenCe ,0 1,16 mem0r * The * «™ a ZZSES^JZ 

If™ f ^ address, whe.eas the rema.nder is a bit address. Therefore, the stage of decoding (in parallel I with The suh 

tion 2£ ^Sr? ,he , addr ^ s wid,h de,inin9 variable field by detec,in « the p° 8i «o" <^£K!^£i2£ 

^onmarker. Jh, S allows the address to be MSB justified (shining in zeros at the LSB). The top six bits bTSd 
directly as a 32 bit word row address of the memory. The bottom five bits can be used to directly conTo.b?th barrel shS 

£££?£S""h 1 XT ^ eXamPl6, ^ ° ri9ina ' 32 Wt addr6SS Wi " a,ways have a shi « o. oSo^thie 
uTr ll £ iSfr? t IS MSB jUSWied) - SimNarly - 3 1 6 bit address can hav * a shift of ObxOOOOiT 

arlnnSn.^. h ^ df6SS ^ have 3 Shm « ° bxxxxX ' ' e - 0 to 31 bi « shifts - ™« extender and input murtipiexer 

a inn«tl y a aC ? eS r S W ' d,h dGCOde to mask out the ™W niultipta the input words to a^ropSte 

significance. respect.vely. The block diagram of the decode is shown in Figure 5 It can be seen that thP H^l «r fh! 
two variable width fields for width and substitution can be done in parallel and ind^ndentfy ° *" 

as shoJTin Ihf h 3 inm n r amPle °r a *** " 0rd 1 3 bitS '° n9 ,0r addres s' n 9 variable width data and substitution 
«L u ° tW ° rOWS For these exam P'es. an eight bit word would have been addressed at lorati™ 
0b1 1 0lssss. where "ssss" is substituted from another address source. addressed at location 

Microcoda ble Stale Machine Structure 

In accordance with the present invention, the substitution into a memory address and the variable width accession of a 
shZTn r" 6 baen bf0U9ht ,09ether in •* implementation of a microcodable state machine the Z 
shown ,n Rgure 6. The structure is one of a state machine 218 providing control of an arithmetic cor^ bv wav of a 

° -nstruc^ons contiguous.y or a jump can occur from one instruSon to another tE^ESS^ £ 

form shown m F.gure 7. The substituted value comes from the Arithmetic core 219 as shown in Figures 6 a* * This 

Sf^rS^T " h, UmP ^ ^ miCrOCOde Pr09ramS - ™«* if a is -ade w?h 3 bits i^tuS 

heamZSc Sr. i ! T P ° SS ? COn, ' 9U0US l0Cati0n5 th3t be jumped 10 ' each de P end *™ °" the value from 
the arithmetic core. i.e.. it has so become a programmable jump. 

Arithmetic P,nre 

MmSEZ anTnnuf; f, Sr^" ^ * 8 """^ C-W a re 9 i6ter ,ile 221 - an Arithmetic and Logic 

To^uT^^- 3 and T 0l ^ 1 Ut J"* 1 ^ TheSS a '* connected via buses and multiplexers. 

As previously stated, these components, and the multiplexers defining their connections, are entirely controlled bv the 

:sr?^^^rs^^T? ,a 218The ALU 222 and the 223 and 224 a - 

^\^S^SZ 11 SSKSS"* width indexed accesses The to - ™° 

There are many advantages of using this method of addressing to the register file. First many locations in an annii 
cation do not need to be the full width of the memory (32 bits in this case). Whilst it will ™ ^efSon tne 0 ^eSn 
of the device to use a full width location, it is very wasteful of memory locations. Minimizing the mmbfr oTme^ToS 
lions w,.. mmimize the amount of space used by the memory and. therefore, minimize ^e capac^^dingS^- 
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ister file This maximizes the speed of the register file. Second, the indexing combined with the variable width of 
memory accessing allows the stepping through of locations of variable width. In the one bit case this allows an elegant 
implementation of long division and multiplication. 

In summary, therefore, there is described a procedure for addressing memory having the following steps: _(1) ipro- 
vidina a fixed width word having a predetermined fixed number of bits to be used for addressing var.able width data (2) 
de« nL the fixed width word with a width defining field and an address field providing the width defining field with at 
f^st on b!» oTerve as a termination marker; (3) defining the address field with a plurality of bits defining the address 
of the data- and (4) varying the size of bits in the address field in inverse relation to the s.ze of the vanable width data 
varying the number of bits in the width defining field in direct relation to the size of the variable width data and maintain- 
ing a fixed width word for addressing variable width data while varying the width of the width defining field and the 
address field In addition, a procedure for addressing memory having the following steps is described: (1) P^d.ng a 
fixed width word having a predetermined fixed number of bits to be used for addressing data: (2) def ming the fixed width 
wo* with an address field and a substitution field; (3) defining the address field with a plurality of bits defying the 
address of the data; (4) defining a variable width substitution field with at least one substitution bit; (5) the sutet.tut.on 
field has at least one bit to serve as a termination marker between the address field and the substitution f.e d; .and (6) 
using the substitution field to indicate substituted bits from a separate addressing source and maintaining a fixed wxtth 
word for addressing variable width data while inversely varying the width of the address field and the w.dth , of the sub- 
stitution field. In addition, a process for addressing variable width data in a memory is described as having the foHowing 
steps (1) providing a memory having words of predetermined width and composed of partial words; (2) rotating the par- 
tial word to be accessed to a least significant bit justification; (3) extending the remaining part of the word so that the 
accessed word will be recognized as the partial word; and (4) restoring the remaining part of the word and rotating the 
word until the partial word is restored to its original position. 

Detailed Description of the Invention for Transforming Data Using a Common Processing Block 

This present embodiment, in accordance with the present invention, relates to a method for the transformation of sig- 
nals from a frequency to a time representation, as well as a digital circuit arrangement for implementing the transforma- 

'° n It is a common goal in the area of telecommunications to increase both information content and transmission 
speed. Each communications medium, however, imposes a limitation on transmission speed, as does the hardware at 
the transmitting and receiving end that must process the transmitted signals. A telegraph wire ,s. for example. typ.cally 
a much faster medium for transmitting information than the mail is. even though it might be faster to type and read a 
mailed document than to tap out a telegraph key. 

The method of encoding transmitted information also limits the speed at which information can be conveyed^ 
long-winded telegraph message will, for example, take longer to convey lhan a succinct message with the same infer, 
mation content. The greatest transmission and reception speed can therefore be obtained by compressing the data to 
be transmitted as much as possible, and then, using a high-speed transmission medium, to process the data at both 
ends as fast as possible, which often means the reduction or elimination of •bottlenecks 1 in the system. 

One application in which it is essential to provide high-speed transmission of large amounts of data is in the field of 
digital television. Whereas conventional television systems use analog radio and electrical signals to control the lumi- 
nance and color of picture elements ( pixels') in lines displayed on a television screen, a digital television transmission 
system generates a digital representation of an image by conveying analog signals into binary 'numbers corresponding 
to luminance and color values for the pixels. Modern digital encoding schemes and hardware structures typical y enable 
much higher information transmission rates than do conventional analog transmission systems. As such digital televi- 
sions are able to achieve much higher resolution and much more life-like images than their conventional ana og coun- 
terparts. It is anticipated that digital television systems including soiled High-Definition TV (HDTV) systems w.h 
replace conventional analog television technology within the next decade in much of in the industrialized world. The 
conversion from analog to digital imaging, for both transmission and storage will. thus, be similar to the change-over 
from analog audio records to the now ubiquitous compact discs (CD's). 

In order to increase the general usefulness of digital image technology, standardized schemes for encoding digital 
images have been adopted. Once such standardized scheme is known as the JPEG standard and is used tar stall pic- 
tures. For moving pictures, there are at present two standards. MPEG and H.261. both of which carry out JPEGjke 
procedures on each of the sequential frames of the moving picture. To gain advantage over using JPEG repeatedly 
MPEG and H 261 operate on the differences between subsequent frames, taking advantage of the well-known fact that 
; the difference, that is. the movement between frames, is small. It. therefore, takes less time or space to transmit or store 
the information corresponding to the changes rather than to transmit or store equivalent still-picture information as it 
each frame in the sequence were completely unlike the frames closest to it in the sequence. 

For convenience, all the current standards operate by breaking an image or picture into tiles or blocks, each block 
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TEST h C^ 4 ,7" f — " an0 896 addL * «^^^«S5S£^^5KS 

Duteftr?CT n !\F h pF^r Wi T e ^ 0dS i 0r binary decimatio " ™* developed by Eong Gi Lee ('A New Algorithm to Com- 
method Recursive bina,y *dmdioa The Lee app-oa* is on,, S££Z £#* C ° S ' ne « ae " nes 8 
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As this very simple example illustrates, if floating-point arithmetic is required, one must either accept a toss of pre- 
w^^^.^^^^^^^eater when the methods for processing the video signals require sev- 

S m*^ ,runcation errors - typica,,y 9rea,er ,or mu,,ip,ica,ion than ,or 

addi A°much more efficient DCT/IDCT method and hardware structure would ensure that the numbers used in the 
mpfh J Tcoul J reorSen ed with a fixed decimal point, but in such a way that the full dynam,c range of each number 

in Lu^TsvsTem fruncation and rounding errors would either be eliminated or. at least, greatly reduced^ 
^^m^SS^ can handle four digits, no number greater than 99.99 were ever needed, and 
I ZZ? Zr ^hTdedmal ooint between the second and third places, then the presence of the dec.mal pant would 
r^T^L.^ arithmetic could be carried out just as if every number were an integer. 

Tn ^S^SoiwSS^ be just as clear as 12.30+4.56=1 6.86. since one would a.ways know that the 
wZSSZSZSm between the middle V and V. Alternatively, if numbers (constant <, rotherw.se) are 
s^X^o'XZZ that they all .all within the same range, each number in the range couidls ao be accu- 

• ?Zl Tfrnm diSrent sources In other words, certain architectures use a single multiplier to perform the mulfplica- 
.nputdata fromdrf ^^^T^j or ,d CT calcu , at ions. Although such "crossbar switching" may reduce the 
m .required , d.ffe ^^ ^^ "^farted mutti p, e xer structures must be included instead to select 

sou HHL ^IZ^ iTnS^- Additional .arge-scale multiplexers are also required to switch the large number 
sources to the ™ £ appropriate subsequent circuitry. Crossbar switch.ng or multiplexing ©. 

^e^ *"5 <> "* «~ - ^ 3 ^ S6miC0n ' 

S £ to deal with the case in which every bit is a'V, a row of adders must be P ro- 

words are 8 bits wide and that one wishes to mu,t iply single inputs by * An 
9 bit SentSon o the number 5 is 00000101 . In other words, digital multiplication by 5 requires only that the input 
12* £ o the Z two places (corresponding to multiplication by 4) and then added to ,ts up-shifted value. The 

£ ooSns S the coeflicients have M values of 0-, so they would not require any shifting or addrt.onal steps. 
01 X^^nT^eXit in this case, a mu.tip.ier capable of muKip.ying only by f h,e ^ 

■ .1 !hi«« «™i Tsinale adder in order to perform the multiplication (disregarding circuitry needed to handle carry 

bits) l!^^SiSt^^^^ ^ shmers and adderS ,0f 6aCh 01 the e,9 ? f CZLTers 
SSflh six of the£ would never need to be used. As the example illustrates, fixed coeff cents can simplify the multipliers 

sSmey alllThe destntrTo eliminate rows of adders that correspond to zeros in the coefficient, thus saving silicon 



40 

area. 



in an IDCT method in accordance with the present invention, a one-dimensional IDCT for each N-row and N-col- 
umn of N-by^ 1£!S££ S decimated and a 1 -D IDCT is performed separately on the N-2 even-numbered pixel input 

4S ^T^T^XST^rZX the JPEG standard. The two-dimensional .OCT resuU is then 
obtaineo by pTrfoVmlng two one-dimensional IDCT operations in sequence (with an intermediate reorder.ng-transpos,- 

^"Tn fcSnmon processing step, for N-8. a first pair of input values is passed without need for multiplication to output 
addl and s^3S Each of a second pair of input values is multiplied by each of two 

™£JS™^Tiv» scaled cosine values. No other multiplications and only one subtraction and one addition are 
£ ° r^nlVc^ The second pair I then added or differenced pairwise with the first pa.r of 

^Tp^^roc^ order odd input word is pre-mu.tiplied by the square root of two and 

the £d inoutToT aTsummed pairwise before processing in the common processing block In a post-common 
55 ^ corresponding to the processed odd input words are multiplied by predetermined 

con ?«rs^ 

of the odd ^resultant values from the even resultant values, and the N/2 low-order outputs are formed by simple addition 
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of the odd resultant values and the even resultant values 



45 



inmates « or ]£l of ^SS^SS^ ° ^ *" ^ the «*. «** 

sca.ed downward by a fact* ?^S5S^^'S^^ a ^P refe ^ and 
» several mu,,ip„cation steps tha, are r^-SSj toS^^'SS^ ^ ^""a e ' imina,eS 

. ues JSSSSS s^ " ^ ^ ^ 3 ~ "* 1 " D « •» output vai- 

are performed on input data worS A 's^^T^^Z ' T""' 3nd P° stcoI ™<>n processing calcu.aSons 
system latches; preferably. ,o seS. tim ^^^SSSK^SSl Si9 ^° ,0adin9 °' V3rious 

• to input .atcbes of the pre-common bio* to TT?^ ,npU WOrdS 

order output signals and to direct subtraction of the odd eihant vat- ZlT ' .f^ ,0 '° rm and ,a,ch low 

the high-order output signa.s and to s«,uen«XcoC^!S mu^eTs ^ * ^ and ' atCh 

P-^bS^^^^ 

order, but rather in an order enabling an efficient T^SSTS^i"^" 61 ^ 0r ^^ 

tro. 2SE££ ^^STS^SZ b J con,igur< S a L a pre:,09ic drcuit - with no c,ock - - 

tion. - - ° perat,on - as be other processing blocks, depending on the particular applica- 

includt^ -J^eo. Rathe, constant coefficient multipliers are 

preferred embodiment of the ,n^r^^ZT^ J ^ arithme,iC devices are inc)ud ed in the 

transformation of video £^^?£^££Z?» 3 and ** 'OCT 

1. Constant use of all costly arithmetic operations- 

answers; » oparaSons «. JSSS aZ ma «hl h""' ' ,me 'L^* 9 ' (Me be "" , > <* ** 
avoid dala, ana to a»ow 8^S^i^ll*S^ *"*■ "'^ ""' "*"°™*°<> » e '*" s *> <* <° 

4. One is able to generate results in a natural order; 

5. No costly, complex, crossbar switching is required- 

6. The architecture is able to support much faster operations- and 

7. The circuitry used to control the flow of data through the transform hardware can be small in area. 
Theoretical Backgroun d of the Invpntinn 

^^SLtS « £ conwner " 8 and *• " lhe *- «»>— 

rascal basis. ' aancrt.ng lo lha prasanl rnvenson, it is halpful to undaraland tha systems theo- 
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Separability of a Two-Dimensional inrrr 



55 2 W ° A '-' 



w n , 0n .o «■ 2/v J L 2/V J Equationl 
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where j,k=0,1...,N-1 and 

^LmizHg^rXe S^ve'oX transform, and the coefficients c(j). c(k) are known normalization factors. 
The expression for the corresponding inverse discrete cosine transform, that is for the IDCT. is as follows: 

x(m.n )= |££cO)C(/c).0-..)cos[«>s[^] Equation2 



where j.k=0 N-1 and 

T«e -o^T St^SS Imm (wh*» m*-m — — such as 

The lorwaro ^ is us« k , andard) into tneir frequency representation. The inverse DCT. as its 

SSCS o S Son^hafis me IDCT transforms the frequency values back into spatda. values. 
toZ e^sSn. Equation 2. (E2). note that the cosine functions each depend on only one of the summabon ind,- 



ces. 

The expression E2 can therefore be rewritten as: 



, 2 ^ / -\ r^mji^l^ r(k x yd k \ cos^ 2n+1)/C7l l Equation 3 

x(m,n) = ^^c(y^cos^^^J£c(/() ^(y.^)cos^ 2N j 

This is the equivalent of a first one-dimensional IDCT performed on the product of all terms that depend on k and n. 
2££ UfS ^^htforward standard data transposition by a second one-dimensional IDCT using as inputs the out- 
puts of the first IDCT operation. 

Definition of t he 1-D IDCT 

A 1 -dimensional N-point IDCT (where n is an even number) is defined by the following expression. 



X (K) - "£ c (n) • y (n) cosp£f±p] * = (0.1 AM} *,uation4 

n=0 



and whe're yln) are the n' input s^thTlnvelse transformation function and x(k) are its N outputs As in the 2-D case the 
tomula forSe DCT has the same structure under the summation sign, but with the normalization constant outs.de the 
summation sign and with the x and y vectors switching places in the equation. 

Resolution ot a 1-D IDCT 

As is shown above the 2-D IDCT can be calculated using a sequence of 1-D IDCT operations separated by a trans- 
Jose t^corSance to one embodiment, each of these 1 -D operations is in turn, broken down into sub-procedures that 
?e then e^lorted to reduce even further the required size and complexity of the semiconductor implementat.on. 

Normalizatio n nf Coefficients 

As is discussed above an important design goal for IDCT hardware is the reduction of the required number of multipli- 
ed ^sttetclied in the circuitry. Most methods for calculating the DCT of IDCT therefore, attempt to reduce 
thl ntr^r ormuiiiplications needed. According to this embodiment, however, all the input values 
scaled upward by a factor of the square root of two. In other words, using the method according to this embodiment of 
Z ^pfesentlnvention the right-hand side of the IDCT expression (E) is deliberately multiplied by the square root of twa 
According to his embodiment, two 1-D IDCT operations are performed in series (with an intermediate transpose) 
to iSSS^icr result. Each of these 1 -D operations includes a multiplication by the same square root of two 
actor ^ Since the intermediate transposition invofves no scaling, the result of two multiplications by the square root of 
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10 



Separation of the 1-D IDCT into Hioh and L ow-Order Qufeyts 



/5 



20 



ouTSI^TZ tTtf ? ** ** ^ ' 0, "• , » <,e, °* ,,,5 < l »°- 1 ) and N« high orte, 



Low order outputs: 



25 



AM 



~ 0 v ' L 2 N J Equation 5 



3£> where k={0,1.„ (N/2-1)} ; and 

c (n) =1/V2 forn^O; otherwise 1 

High-order outputs: 



55 



AM 



x (*) = *(AM-*~)= £ y(n)M,"cos[!L^D] Equa , on6 

40 where k={N (N/ 2+ i)}~ >k - ={0 ,i (f^.,)} 

(Since c(n)=1 for all high-order terms. c(n) is not included in this expression) 

45 

Separation of thp 1-n idct mm F^n an d ohh i npi .^ 

b^eoTnput^ ST*^ a ' S ° ^ MpMl8d Wo Sums: one * * e 

c„ i !,!r,u? ( .1 . V y(4),and ^ 6 > and one for the odd-numbered inputs (for n=8 y(1) vf3) V (S) and „m 

^ Part ' al 51,01 ^ ^ ^ «• "W r^resen/the ^iSt^SSt 

Thus: 

55 ^ $-1 

a (*> = £ c (2„> y (2„, cos { ^ln ] £ c (2fl) y (2n) ^ [SgS ^ ? 

a=0 2 (2) 



20 



Where k={0.1,...,(N/2-1)} ; and 
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,x r n(2j(±1) (2^1) , 



n=0 



Equation 8. 



where k=(0.1,... .(N/2-1)}. miow 
For N=8. observe that the sums in E7 and E8 both are taken over n={0,1 ,2,3}. 

10 Now recall the known cosine identity: 

2.cosA.cosB=COS(A+B)+cos(A-B). 

and set A = n (2/c+1)/2A/ and S = n (2/c+1)(2A/+1)/2/V. 
is One can then multiply both sides of the expression E8 by: 

2. cos A = 1/ {2 cos[ n (2/c+1)/2 A/] } =Ck. 
Note that, since Ck does not depend on the summation index n, it can be moved ^hin the^summa^ 



20 



25 



ther .by Son . that y(-1)-0. and note that the cosine function for the input y(7) is equal to zero. The expression for 
h(k) can then be rewritten in the following form: 



/>(*)- 



2 cos 



— Y [y (2n+1 + y (2n-1)]cos[-i — 



Equation 9. 



30 



NoTS Kpuls 1 imply that in calcu.ating h(k). the odd input terms are paired to form N/2 

paired inputs p(n)=[y(2n+1)=y(2n-1)J . 

For N=8 the values of p(n) are as follows: 



35 



40 



n 


P(n) 


0 


y(-1) + Y(1) = Y(1) Y(-1) - 0 by definition 


1 


y0) + y(3) 


2 


y(3) + y(5) 


3 


y(5) + y(7) 



45 Expression E9 for h(k) can then be represented by the following: 



h(k)=C k £ p(n)cos[ * N ' ] 



50 



n=0 



2 (5) 



Equation 10. 



55 



Ob^rve^i term under the summation sign is the same for both g(k) and h(k) 

the structure off a 1 -D IDCT (compared with expression E5). The result of the IDCT for the odd k terms, that is. for h(k). 
however is multiplied by the factor 



Ck=1/{2cos[n(2/c+1)/2/v\ 
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Now introduce the following identities: 



15 



20 



25 



yn=y{n); 

c7=cos(n8); 

c2=cos(2rc8)=cos(7t4)=1W2; 

c3=cos(3n8); 

tf7 = 1[2.cos(/t1610)]; 

ctt=1[2.cos(3rc/16)]; 

tf5=1[2.cos(5n/16)]; and 

tf7=1/[2.cos(97n/16)]. 
Further introduce scaled cosine coeefficients as follows: 

c7s=V2.cos(ti/8); 
c3s=V2.cos(3rc8); 
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35 



0(O)=VV2.y 0 + y2d + y4c2 + y6c3 = 1A/2.(y + y 2 .C/s + y4 + y 6 .c 3 s) 
g(1)=1/V2.y 0 + y2 c3 - y4c2 - y6c1 = 1/V2. (yO + y2.c3s - y4 - yfi.cfe) + yfi.cte) 
0(3) = 1/V2 .yO - y2d + y4c2 - y6C3 = 1/V2. (yO - y2.cls + y4 - y 6 .c3s) 



and 



/J(0) = C7T [y1 + (yr + y 3 ) d +{y3 + y5) c2 + (y5 + y7) c3) = 
40 dlNZ.{<2.y1 + (y1 + y 3 ). cfe + (y3 + y 5 ) + (y 5 + y7). C 3s) 

h{1) = d3.[y1 + (y1 + y3) c3 - (y 3 + y5)c2 + (y5 + y7) d) = 
d3N2. {V2.yr + (yr + y3) c3s - (y3 + y5) - (y5 + y7) ds) 

45 h(Z)=dS.{y1 +y3)c3-(y3 + yS)c2 + {y5 + y7)c1} = 

d5HZ. {V2.yj - (yr + y 3 ). C 3ys - (y3 + + y 5 ) - (y5 + y7) c»s) 

50 h(3) = ^ I y • (y* + ^) d + (y 3 + y5) c2 - (y5 + y7) c3 J = </7/V2. 

{V2.yr - (yr + y 3 ). crs + (y 3 + y5) - (y5 + y7). c3s } 

°T pres V invention - a " va,ues are «— u ^ d * - »** - ^ ^ 

5(0) = y0 + y? • cfe + y4 + y 3 * c3s Epuation , , 
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0(1) = yO + y2 * c3s - y4 - y6* c3s 
g(2) = yO - y2* c3s - y4 + y6 * els 
and 0(3) = yO - y2 * els + y4 - y6 * c3s 



10 



15 



20 



h(0) = dl [V2 * y7 + (y1 + y3) * cfs + (y3 + y5) + (y5 + y7) * c3s] Equation 12. 

/7(1) = d3 [V2 * y7 + (y* + y3) * c3s - (y3 + y5) - (y5 + y7) ' ds) 

h(2) = d5 [V2 * y) - (yl + y3) * c3s • {y3 + y5) + (y5 + y7) * ds) 

h(3) = 67 [V2 * yt - (yt + y3) * ds + (y3 + y5) - (y5 + y7) * c3s) 

Notice that since c2 = cos (n /4) = 1 N2 . multiplication by V2 gives a scaled c2 value=1. By scaling the expressions 
Z ^^r^^d scaling of the values of the video absolute and frequency values) according to his embodi- 
ment ^^s 1 e^efore^ossible to eliminate the need to multiply and c3s. both of which are constant coefficients . so that 
genial Siliiy mu.tipiiers are not needed. This, in turn, eliminates the need for the correspond.ng hardware murtipher ,n 
the semiconductor implementation of the IDCT operations. 

The similarity in structure of g(k) and h(k) can be illustrated by expressing these sets of equations ,n matrix form. 
Let C be the 4 x 4 cosine coefficient matrix defined as follows: 



25 



c = 



1 ds 1 c3s 
1 c3s-1 -cfs 1 
1 -c3s-1 ds 

1 -C1S 1 -C3S 



Equation 13. 
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9(0) 

'(1) 
9(2) 
0(3) 



AO 



h(0) 

fi(1) 
h(2) 

h{3) 



- c 



= D * C * 



yo 

y2 
yd 

ye 



-42 * y1 
y1 + y3 
y3 + y5 
y5 + y7 



Equation 14 



Equation 15. 



where D= diag[d1 . d3. d5. d7]=the 4 x 4 matrix with d1 . d3. d5. and d7 along the diagonal *^J™ 0 ^* e ™™* 

eoual to zero As E1 4 and E1 5 show, the procedures for operating on even-numbered .nputs to get g(k) and for operat 
« tog on the odd-numbered inputs to get h(k) both have the common step of multiplication by the _cos.ne coeff.c.ent matnx 

C To get h(k). however, the inputs mustfirst be pairw.se summed (recalling that y(-1)=0 by def.nrt.on). y(1) must be pre- 

multiplied by 2. and the result of the multiplication by C must be multiplied by D. 

As the expressions above a.so indicate, the N-point. 1 -D IDCT (see E4) can also be spirt m o the two N/2-po.rt L I- 

D IDCTs each involving common core operations (under the summation sign) on the N/2 odd (grouped) andthe N/2 
so even input values. The expressions above yield the following simple structure for the IDCT as implemented .n tn.s 

embodiment: 



55 



Low-order outputs for (N=8. outputs k={0.1 .2.3)): 



y(k)=g(k)+h(k) 



Equation 16. 
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High-order outputs (for N=8, outputs k={4,5,6,7}): 
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X**N-M*0<lO*W Equation 17. 

Standards of Arrnrary 

by-8 Blocks containing random integers be generated These blocks are thpn nrr ^ inn-r* <T !?, * 

«?f a n2,?H r i?°T ? nd ?*u ar ? * ,0Be °' ,he ' nSti,Ute of E,ectrical and Ele <*°™ Engineers {'IEEE') in 'IEEE Draft 
Standard Speaf.cat.on for the Implementation of 8 by 8 Discrete Cosine Transform- Pi imvncT i . » ™ ? 

Annex A of 8 by 8 Inverse Discrete Cosine Transform', ISO SSSSinS^ik If" 

tially identical to the CCITT standard described above. °° mm,t,ee Draft CD 1 1 1 72 " 2 standards are essen- 

Hardware Implementation 

s~ : '^ 

the i!2 ^rS^^f"" 1 ^""* ""* P " e ' 5 a ' e "* »i«nc,les 230 In the data pah For example 

were tor the constants at . c3. d5. 07 (see exptess.00 E12); a second post-contmon Mock. P0ST02 23S. that sums 
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the go to g3 terms and the hO to h3 terms for the low order outputs, and forms the difference of the > gO to g3 terms and 
the no to h3 terms for the high-order outputs (See expressions E17 and El 7); and ^common block. CBLK 232 .s 
ncluded in both the even and odd data paths. In the processing circuitry accord.ng to the embodiment of the present 
invention, the common operations performed on the odd and even numbered inputs are earned out by a angle struc- 
ture rather than duplicated structure as illustrated in FIG 9. 

To understand the method of operation and the advantages of certain digital structures used in the embodiment, it 
is helpful to understand what "carry bits". As a simple example, note that the addition of two binary, numbers is such hat 
1*1-0 with a carry of "1". which must be added into the next higher order bit to produce the correct result 10 (the 
binary'representation of the decimal number "2"). In other words. 01 * 01 = 00 (the "sum" without carry) * 10 (the carry 
word) adding the "sum" to the "carry word"' one gets the correct answer 00 + 1 0 = 1 0. 

As a decimal example, assume that one needs to add the numbers '436' and '825'. The common procedure for add- 
ing two numbers by hand typically proceeds as follows: 

1. Units '6' plus S is T with a carry of T into the 'tens' position - 

' Sum 1. Canyin 0, Carry-Out: 0. 

2. Tens: '3' plus 2 is 5\ plus the 'V carried from the preceding step, gives '6' with no carry - 

Sum 5. Carry-In 0. Carry-Out:0. _ 

3. Hundreds: 4 plus 8 .s Z with a carry of 1 into the thousands, but with no carry to be added in from the previous 

step; 

Sum: 2. Carry In ). Carry-Out:1 

4. Thousands: 0 plus 0\ plus the T carried from the hundreds gives. T 

Sum: 0. Carry- In 1 . Carry-Out: 0. 

The answer 1261- is thus, formed by adding the carry-in sum for each position to the sum for the same position, 
with the carry-in to each position being the carryout of the adjacent lower-order position. (Note that thismnpl. that the 
carry-in to the lowest order position is always a '0'). The problem, of course, is that one must wart to add the 4 and 8 
in the hundreds place until one knows whether there will be a carry-in from the tens place. This illustrates a ripple 
adder", which operates essentially in this way. A ripple adder, thus, achieves a final' answer without needmg extra stor- 
aae elements but it is slower than some other designs. . 

One such alternative design is known as 'carry-save', in which the sum of two numbers for each position is formed 
by storing a partial sum or result word (in this example. 0251) and the carry values in a different word (here 1010). The 
full answer is then obtained by 'resolving' the sum and carry words in a following addition step. thus. 0251+1010 = 
1261. Note that one can perform the addition for every position at the same time, without having to wart to determine 
whether a carry word can be added to the partial result at any time as long as it is saved. 

Since the resolving operations typically require the largest proportion of the time required in each calculation stage, 
speeding up these operations has a significant effect on the overall operating speed while requiring only a relativey 
small increase in the size of the transform. Carry-save multipliers, therefore, are usually faster than those that use ripple 
adders in each row. However, this gain in time comes at the cost of greater complexity, since the carry word for each 
addition in the multiplier must be either stored or passed down to the next addition. Furthermore in order to obtain the 
final product of a multiplication, the final partial sum and final carry word will have to be resolved, normally by addition 
in a ripple adder. Note, however, that only one ripple adder will be needed, so that the time savings are norma ly pro- 
portional to the size of the multiplication that must be performed. Furthermore, note that a carry word may be treated 
as any other nurrtoer to be added in and as long as it is added in at some time before the final multiplication answer ,s 

needed, the actual addition can be delayed. ... ^ . ... ^ 

In this embodiment of the present invention, this possibility of delaying resolution is used to simplify the design and 
to increase the throughout of the IDCT circuitry. Also, certain bits of preselected carry words are. optionally and delib- 
erately forced to predetermined values before resolution in order to provide greater expected accuracy of the IDO I 
result based on a statistical analysis of test runs of the invention on standard test data sets. 

FIG 1 0 is a block diagram that illustrates a preferred structure, in accordance with the present invention. In this pre- 
ferred embodiment of the present invention, the even and odd numbered inputs are time-multiplexed and are processed 
separately in the common block CBLK 232. The inputs may be processed in either order. 

In FIG 1 0. the notation Y[1 .0]. Y[5.4]. Y[3.2] and Y[7.6) is used to indicate that the odd numbered inputs Y1 . Y3, _Y5. 
Y7 preferably pass through the calculation circuitry first, followed by the even numbered inputs Y0. Y2. Y4. Y6. This 
order is not essential to the present embodiment: nonetheless, as is explained below, certain downstream arithmetic 
operations are performed only on the odd numbered inpuls. and by entering the odd numbered input values first, these 
downstream operations can be processing at the same time that arithmetic operations common to all inputs are per- 
formed upstream on the even numbered inputs. This reduces the time that several arithmetic devices would otherwise 
remain idle. 



25 



EP 0 891 094 A2 



45 



SO 



55 



As shown in FIG 10. adders and subtracters are inditecad by circles either a V fadriprt o-w • • - » ,«„ 
which is an adder with one comDlementinn in™ it nr •+• /— i. ~L . (.adder) 235, - (subtracter) 236 

tion and subtraction »7SKSS2?if TJ" 9 add u er/sub,ract ° r - * able to switch between addi- 

•n other words, the first addons tSSSSSL in £ i«^^S^S^i , l!! , ' ■*«?***^ 
LT=eS^ 

™^ J* « ~ OH and h(k) values £££££ 

input Y! 2 must also be multiplied by the sZ^ootof Zl^lT^T* ^ SUCh pairin£ »- but 

the common block CBLK 232 l7e pre^X tSL ? rec^tSlT^ ?2 ^ ^ Va ' UeS ^ PreS6nted to 
5 C10 r«vi r-?9 anH (-^e « u P recommon DlocK PR EC 231. therefore, includes a 2-input multiplexing Cmuxl latch 

resolving spuare^oot^o SS^SSSKK?^ SiSTtST I* 1 ^ ^ 

- generate £e 2S^SS5£r t 1 ' ^ °f ° 7 ^ their oSputs ' lha < * 

from the multipliers have th 2 ££ ^tf!iS^!T n,, l' C r P ' ete ^ ™ S 6nSUreS that *• 

no, b^mptme"^ J^IST ** ^ *~ 3 ,U "- SC * le a ^^ubtraCor need 

CB Jk S?^^ e a S Th*" ° U *"? ,f0m ^ 3dderS and sub,ra « ors in *• common blocK 

prefe^^^ «J-* «he odd-numbered inputs in th* 

first. Supervisory control circurtry (notThTwn in F^G To)' 132 2? 2 f in P ute are •» «- processed 

block PREC. and selects the lower inputs Slewed i« in PI S S)^f ,h^» ^ f /"^ ™ * * ,he P^" 1 ™" 
then stores the paired values d0 to o3 (111 f^?^ UmJ l f muB, P lex,n 9 etches C10. C54. C32. C76 which 

are then activated to ^Z^g^SS ™ ™* ^ 1 ' * - ™ 

.atches. Since the even-nun^eXn^Te u^to form £ vSl S oO t^Tr" 1 ^ ^ M ,0 tHeSe 
opens the latches LgO to Lg3 in the Z«^£5£££& to So e ft K, ^ *° 

X6 SZSttPS ^ ? h ' UeS ^ the P° st - comm on Wod, POSTC 233 ouSl the high-order sianals X7 

S* 5 £ x? y r 9 reS0,Vi " 9 3dder subfractors «o «he subtraction mode. The ^ ordfouSut sSnS S* 
X2. XI and XO are then generated by switching the resdving addersfcubtractors to the addition ^SSSSSSS- 
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put data can be presented in an arbitrary order, including natural order. 

The preferred multiplexed implementation, in accordance with the present invention, is illustrated in greatly simpli- 
fied schematic form in FIG 10, performs the same calculations as the non-multiplexed structure illustrated in FIG 9. The 
number of adders, subtracters and multipliers in the common block CBLK 232 is, however, cut in half and the use of 
c dummy adder/subtractors 240 further reduces the complexity of the costly arithmetic circuitry. 

FIG 1 1 illustrates the main components and data lines of an actual implementation of the IDCT circuit according to 
the embodiment of the present invention. The main components include the precommon block circuit PREC 231, the 
common block circuit CBLK 232, and the post-common block POSTC 233. The system also includes a controller CNTL 
241 that either directly or indirectly applies input, timing and control signals to the precommon block PREC 231 and 

w post-common block POSTC 233. ^ ^_ ( v __ 

In the preferred embodiment of the present invention, the input and output signals (YO to Y7 and XO to X7, respec- 
tively) are 22 bits wide. Tests have indicated that this is the minimum width that is possible which still yields acceptable 
accuracy as measured by existing industry standards. As is explained in greater detail below, this minimum width in 
achieved in part by deliberately forcing certain carry words in selected arithmetic devices to be either a T or a 0'. This 

15 bit manipulation, corresponding to an adjustment of certain data words, is carried out as the result of a statistical anal- 
ysis of the results of the IDCT system, in accordance with the present invention, to the after using the IDCT transforma- 
tion of known input test data. By forcing certain bits to predetermined values, it was discovered that the effects of 
rounding and truncation errors could be reduced, so that the spatial output data from the IDCT system could be made 
to deviate less from the known 'correct* spatial data. The present invention is equally applicable, however, to other data 

20 word lengths since the components used in the circuit according to the present embodiment can all be adapted to dif- 
ferent bus widths using known methods. _____ 
Although all four inputs that are processed together could be input simultaneously to the pre-common block PREC 
along 88 parallel conductors (4 x 22), pixel words are typically converted one at a time from the transmission data. 
According to the present embodiment, input data words are, therefore, preferably all conveyed serially over a single 22 

25 bit input bus and each input word is sequentially latched at the proper input point in the data path. As shown in FIG 1 1 . 
the 22 bit input data bus is labelled TJN[21 :0] 242. 

In the Figures and in the discussion below, the widths of multiple-bit signals are indicated in brackets with the high- 
order bit to the left of a colon':' and the least significant bit (LSB) to the right of the colon. For example, the input signal 
T IN[21 0] 242 is 22 bits wide, with the bits being numbered from 0 to 21. A single bit is identified as a single number 

30 within square brackets, thus, TJN[1] indicates the next to least significant bit of the signal TJN. 

The following control signals are used to control the operation of the precommon block PREC 231 in the preferred 

embodiment of the present invention. 

IN_CLK OUT_CLK: The system, in accordance with the present invention, preferably uses a non-overlapping two 
phase clock' The signals IN_CLK and OUTjCLK are accordingly columns of latches that hold the values of input, inter- 

35 mediate, and output signals. . 41 

LATCH 10 LATCH 54, LATCH32, LATCH76: Preferably, one 22-bit word is input to the system at a time. On the other 
hand four input signals are processed at a time. Each input signal must, therefore, be latched at its appropriate place 
in the architecture before being processed with three other input words. These latch signals are used to enable the 
respective input latches. The signal LATCH54, for example, is first used to latch input signal Y5 and later to latch input 

40 signal Y4, which enters the pre-common block PREC 231 at the same point as the input signal Y5 (see FIG 10) but dur- 
ing a subsequent processing stage. DD _. O0 . 

LATCH- Once the four even or odd-numbered input signals are latched into the pre-common block PREC 231 . they 
are preferably shifted at the same time to a subsequent column of latches. The signal LATCH is used to enable a sec- 
ond column of input latches that hold the four input values to be operated on by the arithmetic devices in the pre-com- 

45 mon block PREC 231. 

SEL BYP SEL_P: As FIG 10 illustrates, the even-numbered input signals that are latched into the latches oio. 
C54 C32 and C76 should be those that bypass the adders and the square root of two resolving multiplier. The odd- 
numbered input signals, however, must first be paired to form the paired inputs p(n). and the signal Y1 must be multi- 
plied by the square root of two. The control signal SEL_P is activated in order to select the paired input signals. Hence, 
so these signals are used to control gates that act as multiplexers to let the correct signals pass to the output latches of 
the precommon block PREC 231. 

As discussed previously, not having to arrange the inputs in strictly ascending order leads to a simplified butterfly 
bus structure with high interconnection efficiency. As also described, the odd inputs are preferably applied as a group 
to the pre-common block first, followed by the even-numbered inputs, but any order may be used within each odd or 
55 even group, i.e.. any order of inputs may be used, however, suitable latch arrangements as separately provided to proc- 
ess the odd-numbered inputs, or at least are provided in separate regions of the circuit. 

The supervisory control circuitry also generates timing and control signals for the post-common block POSTC 233. 
These control signals are as follows: 
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the m ^n^ir ,he r mon b,ock cblk ^ after - 

d1. d3. d7. d5. respectively, in ^^o^S^S STS.^ST! * ^ multip,ierc ' 
that hold the gO to g3 values as weM as to 1 £2 L f J f ™* S ' 9nal EN - BH is used to enab| e latches 

5 plied in the coefficient mu!Ser S * ' a,CheS h °' d the h0 10 h3 va,ues after have been muHf- 

feren^S) "iS^f ^iKS^^ 3 bank ° f reSoVin9 ^rs/subtractors that sum and * 
the r add^ ADD. SUB are used to set 

- MU dS25?K SS5 MUX SSS?, ^ reS ? S fr0m Subtracters, 

data from the system is preferably S£S2 ^^^Z^^ZI^T ETT 1,16 

bJS ^T^J^TESJ^ 23 to 3re * atChed ,0 ^ 4 ™Sto the con™ 
four 22-bit data words 0^21:0, 0 f^TalS? SS^JtHT 1 b,0 * h PREC " P' 65 ^ 35 *» 
IN[2J. respectively, to the common block CBLK 232 1 1 ° ] ' bSCOme the inpUt Si9na,s IN l°J- 

mon block POSTC 233 as 5£S5 CO^" TcS^S&t? - " ^ ° f the 

-nkS^SKS^r^^^ ^ *£2 for * — «oc k CBLK. Because of the 
as pure logic operations with no to ctoT?^„o V, 71 ^ ° f * e SyS,em ' S operations can b « Performed 
» device. One should ale note t£at insert in 7 C °. ? * 9 " a,S ™» ,urther reduces 1,16 «™P'«*ity of the 

a., needed arithmetic o^erat^ 

arranged to operate without clock timing or control signals ' ° STC 233 a,so be 

uresXltoiatton SfJaE * SifSIZSZS' *V ° 23 1 °' P '- nf h "" ,faa •* and toll ™"° Fig- 
■ of the signers bus width. Sfef *3 ihe s^ectS bits' a b ^ TEE? ^ t in,e96rS Wimin the ™ 9e 

parallel over the same bus. with the most Mfarthih LqrVk T ' ? S ' 9nalS S1 ' S2 SM are ^nsferred in 
significant bits (LSBs) being J» wS ? SfSS 2 ^SZZ-??*" ' 3 ' ° f 816 si9 " al Sl ■ and the 
entire or partial multi-bit wo?d" mj aSb tranSil^ 5 fJ* b " S , d ° " 0t haVS *° be individual bi,s - but ratb <* 
•n the Figures the symbol sX^X ^ 

from thls"^ iS S ^!"f R2MUL. The 'save'; or 'unresolv*, sum' output 

plier R2MUL is ^ «SS^^SS^ *" OUtput ,rom the ™«- 

resolving adder MSA. (Recall thai la X ^STi o^ °T *" bUS t0 the t> ' input of a 
this is accomplished before being appl ,^ to tof J ^ "SI*** 21 bte ° f 016 Save <**»*• howwer - 

notation GND M5Sf20on n ZS;,? w P 6 reSOlV ' ng adder M5A " 71115 is indicated in FIG 12 by the 
be a 0- by tying it to ground GND ™~ K ** COrrespondin 9 to MSB input to the adder MSA is forced to 

cati^ n S££S5?X£: ^RSLt? f "I ^ .T ' sum '' observe that " the partiaI sum of a 

fore, extends to n + .places Zh 7£f£Z£ SIT I * partial Sum " ^ «ord. there- 

there is nothing be^SnC^a^ m ZZ^s ^Zt'^T ^ «*~ 

to a resolving binary adder, care must be teken tOB^ ZV^^f^TSi ° W ° rdS are used as inpute 

IN54L. IN32L. AND N7BL ££££££ a2h fe ^N^.' S '° ^ ^ ,atCh6S ,N10L - 

spending latch selection signal wSSwSS s f i9na ' ,N -CLK and the corre- 

lated into their respective input latched ■ r ^our oeriods ofth^i?? t ^ 7,16 ,OUr inpUlS ^ theretor * be 
bling signals LATCH10 LATCHS4 TS5« S ' 9nal by s^" 6 "* 31 activation of the latch ena- 

different phase, to JitSSl^SSlS iK^S^ 1 ^!^ * '° W ( ° r °" 9 

An example of the timing of the latches, in SSCSr^S^iTS 
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four input signals are latched in the preferred order, they are passed to a second bank of latches LI 0L, L54L L32L, 
^6L^^se«nd bank of latches are enabled when the signals OUT.CLK and LATCH are h,gh. This signal t,m,ng 



is also illustrated in FIG 13. 
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Note that the system of the present invention does not have to delay receipt of all eight input words. Once all the 
even !2 «£ in P ut are receded and latched in IN10L. IN54L. IN32L and L76L. this frees the In latches, wh,ch can 
then begin to receive the other four input signals without delay at the next rising edge of "N-CLK. 

Th?2-digit suffix notation (1 0. 54. 32. 76] used for the various components .llustrated ,n the F.gures .ndicates that 
odd-numbered signals are processed first followed by the even-numbered signals on a subsequent pass trough the 
sTrucTe. As is mentioned above, this order is not required by the present invent.cn. and .t w,ll be appreaated by one 
of ordinary skill in the art that additional orders may be used. .... . L76L th6 

Once the four input signals are latched in proper order in the second set of latches L10L. L54L. L32L. L76L he 
corrSponlg vies are either passed as inputs to output latches C10L. C54L. C32L and C76L on activation of the 
S eSbvoass signal SEL BYP or they are passed as paired and multiplied inputs to the same output latches upon 
S 3 *e Set p signal SEL_P. In other words, ail signals are passed, both directly and ind,rect.y. via anthmetic 
devSL to the output latches C10L. C54L. C32L. C76L of the pre-common block PREC 231 . The proper values how- 
eve are Voided into these latches by activation of the "select bypass' signal SEL_BYP (for even-numbered inputs Y0. 
5 : Y? zSyS or the "select p" signal SEL-P (for the odd-numbered inputs Y1 . Y3. Y5 and Y7). As w,l be appreciated 
by one'of ordinary skill in the art. the desired timing and order of these and other control signals is eas.ly accomplished 
in a known manner by proper configuration and/or [micro-] programming of the controller CNTL 241 . 

T^ruppernist input^alue at ?he output of latch L10L is passed first to the square root of two-muKipher R2MUL 
and then to the resolving adder MSA as indicated. The output from the resolving adder MSA is shc^n as an equiva en 
of the resolved multiplication of the output from the latch L10L by the square root of twa The outputs from the K>ther 
Svee latches L54L L32L. L76L are also transferred to corresponding output latches C54L. C32L and C76L. respec- 
bth dTreciy via 2 2 - b it .atch buses LCH54[21 :0]. LCH32[21 :0] LCH76[21 :0] and indirectly to the output latches v,a 

racni\/ino adders P2A P1A and P3A, respectively. ^ , 

n ^ent Mention, each resolving adder P2A. P1A. P3A has two inputs v and "b". adde. P2A the one 
inou is received from the latch L32L. and the other input is received from the latch L54L For input values YS (latched 
n L54L) Z Y?( SchSln L32L). the output from the adder P2A will, therefore, be equal to YS + Y3. which as .s shown 
above s equal to p(2). Hence, the adders "pair" the odd-numbered inputs to form the paired input values ; p(1). p(2) and 
5£« cSrse Z even-numbered input signals latched in L54L. L32L. and L76L will also pass through ^ the ^resolving • 
addersP2A P1A and P3A. respectively, however, the resulting p "values" will not be passed to the output latches C54L. 
C32L and C76L because the "select p" signal SEL_P will not be activated tor J» e ™ u ^ er *!^ . . „ .. c|ock 
The values that are latched in the output latches C10L. C54L. C32L and C76L upon act.vat.on of the input clock, 
signal IN CLK will therefore be equal to either the even-numbered inputs YO. Y2. Y4. Y6 or the pa.red input values PO . 
P?P2 pf tor the odd-numberedVnputs. One shouk. recall that the input Y(1) is "paired" with the value "(-l^which «s 
assumed to be zero. As illustrated in FIG 1 2. this assumption is implemented by not adding anything to the value Y1 . 
Instead Y1 is only multiplied by the square root of two as is shown in FIGS. 9 and 1 0. • 
fIg 14 Urates the prefen-ed architecture of the common block CBLK 232. in accordance with the present .nven- 
tion Because of the various multiplications and additions in the different system blocks, it is necessary or advantageous 
to scale down the input values to the common block before performing the various calculations. This ensures a un,form 
poSnSe deS point (which is implied for integer arithmetic) for corresponding inputs to the various anthmetic 

d6V ' /S^dTngfy! theTnput values IN0[21 :0] AND IN1[21 :0] are accordingly scaled down by a factor of four, which corre- 
sponds in digtal arithmetic to a right shift of two bits. In order to preserve the sign of the number (keep positive values 
S and negative values negative) in binary representation, the most significant bit (MSB) must then be replicated 
in the two most significant bits of the resulting right-shifted word; this processis known as "sign e^nsion ^Hence the 
input value INO is downshifted by two bits with sign extension to form the sh.fted input value indicated as INJ21 ]. IN0[21] . 
IN0[21 2] The input value IN1[21 :0] is similarly sign-extended two places. The input IN2 is also sh.fted and extended to 
formlN2f21l IN2I21-11. These one-position shifts correspond to truncated division by a factor of two. 

As shown in FIG 10 the input IN2. IN3 are those which must be multiplied by the scaled coefficients ds and cSs^ 
Each input IN3 and IN2 must be multiplied by each of the scaled coeff icients. As FIG 14 illustrates this is • 'np'^ented 
by "he four constant-coefficient carry-save multipliers MULC 1S. MULNC1S. MULC3S3. and MULC2S2. One should 
note that the bottom multiplier for IN2 is an inverting multiplier MULCIS. that is. its output corresponds to the negative 
of the value of the input multiplied by the constant C1 S. Therefore, the value latched in C76 is subtracted from he value 
latched in C32 (after multiplication by C3S). By providing the inverting multiplier MULNC1 S. subtraction .s implemented 
by adding the negative ol the corresponding value, which is equivalent to forming a difference. This allows the use of 
identical circuitry for the subsequent adders, while allowing a non-inverting multiplier may be used with a following sub- 



tractor. 
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MULCpS S p e « 0 ° dlmen, 1 °I the preSent invention - fou ' cosine coefficient multipliers MULC1S MULNC1S 
J In accordance with the present invention, the multipliers for MULC1S MULNC1S MULVM and mi ii r,o, 

added bier in the aaS? ca> C °" nected as ,he w "W °l the same adder. In olhet words. IW and INI are 



^^5S5T*r^ ^ ^ Sdder/SUbtraCtorS <*W» «o form the SHSfSS 
Two advantages of the IDCT circuitry according to the embodiment of the present invention can be seen in FIG 1 4. 
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First no control or timing signals are required for the common block CBLK 232. Rather, the input signals to the common 
block are already processed in such a way they can be applied immediately to the pure-logic arithmetic devise in the 
common block 232. Second, by proper scaling of the data words, integer arithmetic can be used throughout (or. at least, 
decimal point for all values will be fixed). This avoids the complexity and slowness of floating-point devices, with no 
unacceptable sacrifice of precision. 

Yet another advantage of the embodiment of the present invention is that, by ordering the inputs as shown, and by 
using the balanced decimated method in accordance with the present invention, similar design structures can be used 
at several points in the silicon implementation. For example, as shown in FIG. 14. the constant coefficient multipliers 
MULC1S MULC3S3. MULC3S2 and MULNC1S all have similar structures and receive data at the same point in the 
data path so that all four multipliers can be working at the same time. This eliminates "bottlenecks" and the semicon- 
ductor implementation is. therefore, able to take full advantage of the duplicative, parallel structure. The carry-save 
adders BT2 and BT3 similarly will be able to work simultaneously, as will the following carry-save adders and subtrac- 
ters. This symmetry of design and efficient simultaneous utilization of several devices is common throughout the struc- 
ture according to the embodiment of the present invention. 

FIG 15 shows the preferred arrangement of the post-common block POSTC 233 in accordance with the present 
invention As FIG 10 shows, the primary functions of the post-common POSTC 233 are to form the hO to h3 values by 
multiplying the outputs of the common block by the coefficients d1. d3. d5 and d7; to add the g(k) and h(k) values to 
form the low order outputs; and to subtract the h(k) values from the corresponding g(k) values to form the high-order 
outputs Referring now to both FIG. 10 and FIG. 1 5. the post-common block POSTC 233 latches the corresponding out- 
puts from the common block CBLK 232 into latches BHOL. BH1L. BH3L and BH2L when the Bh latches are enabled, 
the control circuitry sets the EN_BH signal high, and the output clock signal OUTCCLK signal goes high. The g(k). gO 
to g3 values are latched into corresponding latches GOL. G1L. G3L and G2L when the control circuitry enables these 
latches via the signal EN_GH and input clock signal IN_CLK goes high. 

The processed odd-numbered inputs, that is. the values hO to h3. are latched into latches HOL. H1 L. H3L and H2L 
when the EN_GH and IN.CLK signals are high, via the constant coefficient multipliers D1MUL, D3MUL. D5MUL and 
D7MUL These multipliers multiply, respectively by d1, d3. d5 and d7. In the preferred embodiment, these constant-, 
coefficient multipliers are preferably carry-save multipliers in order to simplify the design and to increase calculation 
speed As FIG 15 illustrates the "carry" ("c") outputs from the constant coefficient multipliers are connected, with cer- 
tain changes described below, to the a inputs of resolving adders HOA. H1 A, H3A and H2A. The "save" ("s") outputs 
from the coefficient multipliers are similarly, with certain forced changes described below, connected to other input of. 
the corresponding resolving adder. 

As FIG 15 further illustrates, the LSB of the HO signal is preferably forced to be a "1" by tying the corresponding 
"save" output tor HO is set to 0 (tied to ground GND), and the second bit (corresponding to H0S[1]) is set to "1". The 
data words from the carry and save outputs of the constant-coefficient multiplier D3MUL are similarly manipulated an . 
input to the resolving adder H1 A. The advantage of these manipulations and their input to the resolving adder H1 A. 

In accordance with the present invention, all 22-bits of the carry output from the coefficient multipliers D7MUL and 
D5MUL are connected directly to the "a" input of corresponding resolving adders H3A and H2 A. The MSB of each mul- 
tiplier's "save" output, however, is forced to "0" by tying the corresponding data line to ground GND. 

The IDCT system described was tested against the CCITT specification described above. Because of the scaling 
and other well-known properties of digital adders and multipliers, some precision is typically lost in the 10.000 sample, 
but run that forcing the various bits described above to either "0" or "1" reduced the expected error of the digital trans- 
formation As a result of the bit manipulation of the data words, the embodiment of the present invention achieved 
acceptable accuracy under the CCITT standard using only 22-bit wide data words, whereas 24 bits would normally be 

required to produce equivalent accuracy. 

Because of limited precision, and truncation and rounding errors, there is typically some inaccuracy in every data 
word in an IDCT system. However, forcing selected bits of a data word it was discovered that the error thereby system- 
atically introduced into a particular data word at a particular point in the hardware yielded statistically better overall 
results. Bit-forcing may also be applied "within" a multiplication, for example, by selectively forcing one or more carry 

bits to predetermined values. 

In the present invention, the bit-forcing scheme need not be static, with certain bits always forced to take specified 
values, but rather a dynamic scheme may also be used. For example, selected bits of a data word may be forced to "1" 
or "0" depending on whether the word (or even some other data) is even or odd. positive or negative, or above or below 

a predetermined threshold, and the like. 

Normally, only small systematic changes will be needed to improve overall statistical performance. Consequently, 
according to this embodiment of the present invention, the LSB's of selected data words (preferably one bit and one 
data word at a time, although this is not necessary) are forced to be a "1" or a "0". The CCITT test is run. and the CCITT 
statistics for the run are compiled. The bit is then forced to the other of "1" or "0". and the test is rerun. Then the LSB 
(or LSBs) of other data woids are forced to "1" or "0". and similar statistics are compiled. By examining the statistics for 
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various ^nations of forced bits in various forced words, a best stafstica. performance can be determined 
DIMU^^ - a «** <™ the constant-coefficient mu.tip,iers 

The lower 21 -bits of the input of ^c^S^SE - ^Tw^^ H0A-H3A. 

The outputs from the H-latches (H0L-H3U and Ihe frhtehSml Si w ,! bed t0 9r ° und - 
inputs to resolving adder-subtractors S70A S61AS43A an5 ?^a ( ? ^ J? * form ,he res P ectiv e a and b 
inputs when the ADD signal is high and suWrS tte ^ .*! Was ,nd,cated devise add their 

is high. The second bits of th B .^^r"r::"! " r0m ne a W wnen subtraction enable signal SUB 
ments in a manner described taET U ' L ^ ° 1 L 3fe mani P ulated * multiplexing arrange- 

The outputs from the resolving adder-subtractors S7f»A cria q>ha »^ o M , 
R70L. R61L. R43L. R52L. suoiractors &70A S61A. S43A and S52A are latched into result latched 

As depicted in FIG 15b, the input words to the adder/subtrartnr <?7nii nr,ri weci a 
invention, have the second bits of each inrJworrt m»Z T2 £ S61Al ,n accordar, ce with the present 

■a'-input of the adder S^l^S^J^^ , F ° r f amp,e - ,he second bit of the *** word to the 
- G01 M. The second «. SSSS^SSSSffi SESXE a^"? * T "** 
manipulation is accomplished by four 2:1-bit mmJ^SSSS^^^!^^ "amputated. This bit 

The four high or low-order results are latched in the output latches R70 1 rah d^qi * ^ 

conned Sim. y b^g ^1:^* Eg"** " «** « ^ ° a 
13b S it P ^ ^ °° nW S, ' 9na,S in the ^-common block POSTC 233 is shown in F.GS. 

' »£S£^f^^ IT™ iT be S6ries ' with an Evening 

therefore, according to SStSSSSi of the 2£rE SETS £l ^ T™™" block P °STC 233. are 
wise) in a conventional storage SnTSch as a RAM 1™ f / 3 kn ° Wn manner "'"umn-wise (or row- 

unit row-wise (column wiSTso as o be - . * V (n0t Sh ° Wn)l and are then read from the 

■ described aboSs ^ "lock a <* ** Processing as 

o* • i , . amnion diock odLi\ ^32, and a post-common Work po^tp oqq 

tiplications performed in the 1 -D IDCT operation^ necessary to offset the two square root of two mm- 

Prefera^Ts^ «en«ca, to that shown FIG 11) is 

circuits were used for both transforms ^ESh—JT? n? ? decrease ,n tha t wouW arise if the same 

^A^ 

--^ilSl^i^SP e^iment have been changed to ^st-resoMng-adders". As 

each datapath ^Z£^^K2?2 %^^T£rT* ? ^ ^ ** 

been changed to 2-phase Hip-flops" or -register- ,heeX ' S,,n9 ,atches ^ f.rst preferred embodiment have 

comb?n e edt,^ 

whicl hasten" Sgh the^D loCT T 2™ Un0Pera,ed Upon da,a > «* *«" T2" (data 

Pipeline in a time !SS£i " ** TRAM) ' " intr0duced into the 
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In the present invention, each stream takes its turn to introduce a group of data items into the datapath pipeline. 
The data streams are "interleaved" as they pass sequentially down the datapath pipeline and are "de-interleaved" at the 
datapath output, as shown in FIGs. 1 7. 1 8 and 33. A group can vary in number, but in this example, they are eight b.ts. 

In accordance with the present invention. T1 must not be stalled. If T2 arrives at the point of interleaving with T1. 
but the input buffer should not introduce its data into the pipeline because this would clash with the T1 stream, then 
stream T2 provides an extra buffering so that T2 does not stall the data stream, but instead will buffer up data from its 
input stream until such a time as it may safely interleave with stream T1 . This is shown in FIGs. 19 and 33 where the 
data from stream T1 is being loaded into the first transform in latches 0-7. using signals. "Latch 1(0) 'through Latch 
1 (7)". Additionally, data from T2 is being loaded in "Latch 2(0) 'through' Latch 2(15)". as shown in FIG. 1 9. using signals 

shown in FIG. 33. t . . 

The interleaving is controlled by "T1 OK2 insert" and "T2 OK2 insert" signals. Under normal operation, the inter- 
leaving will occur when the signals go high. However, if the appropriate amount of data in the latch for T2 has not yet 
been reached when "T2 OK insert" goes high, then the latch will miss its opportunity and must continue buffering data 

until the next opportunity to insert data occurs. 

In summary if the above described buffering, in accordance with the present invention, is to occur, comparable 
"slippage" has to occur at the output of T2. T2 slips when it misses its data insertion point and has to continue buffering 
in the latches shown in FIG. 19. If T2 slipped and did not introduce data into the pipeline there will be a corresponding 
aao in the T2 stream output at the datapath output. This gap may be removed or "swallowed up" by use of the extra 
buffering at the T2 output. This process may be thought of as having a "fixed" T1 - 1D IDCT transform with a vanable 
T2 - 1 D IDCT, where the data streams are interleaved in a time multiplex fashion such that they may use the same piece 

of arithmetic datapath pipeline. 

In the present invention, "Recovery" takes place when non<iata enters T1 . It is an opportunity for the T2 buffer to 
catch up to T1 and the datastream. Non-data is a data type that bypasses the IDCT and is shown as a data spike in 
"Latch 2 [ 6 r of FIG 34 This eventually makes its way to T2 input, which allows the T2 buffering to fill up at the output. 
Recovery is shown in FIG. 33 and FIG. 25 when the T2 dout" signal and the "out" signal are gapped by a number of 
cycles The gap is used as a reference to fix the data stream. It should be noted that the gap in cycles between these 
two signals is the same as the gap of buffering when the latch for T2 was waiting to insert its data. 

Following the TRANSFORM in POSTC 233 part B, the interleaved stream is de-interleaved into "T2 out", as shown 
in FIGs 18 and 23 The "T2 out" data stream has slip gaps in the data as described above. The T2 out [1 43: *]. shown 
in FIG 17 enters a 16 to 1 multiplexor block, shown as block "IDDPMUX" in FIG. 17. This multiplexor block will select 
data from one of 16 positions in the output buffer block, as shown in FIG. 25. This position is selected by the control 
logic, shown in FIG. 29. which uses the gap by which T2 "buffered-up" at its input. This gap is used as a reference. The 
output stream. T2DOUT, from the multiplexer block is the "fixed" data stream. 

In range tests carried out on an embodiment of the present invention for the IDCT arrangement described above, 
it was found that all intermediate and final values were kept well within a known range at each point while still meeting 
the CCITT standards Because of this, it was possible to "adjust" selected values as described above by small amounts 
(for example, by forcing certain bits of selected data words to desired values) without any fear of overflow or underflow 

in the arithmetic calculations. 

The method and system, in accordance with the present invention, can be varied in numerous ways. For example, 
the structures used to resolve additions or multiplications may be altered using any known technology. Thus, it is pos- 
sible to use resolving adders of subtractors where the preferred embodiment uses carry-save devices with separate 
resolving adders. Also, the preferred embodiment of the present invention uses down-scaling at various points to 
ensure that all values remain within their acceptable ranges. Down-scaling is not necessary, however, because other 
precautions may be taken to avoid overflow or underflow. 

In one embodiment of the present invention, certain bits of various data words were manipulated to reduce the 
required word width within the system. However, the various intermediate values may. of course, be passed without bit 
manipulation. Furthermore, although only data words were bit-manipulated in the illustrated example of the present 
invention it is also possible to manipulate the bits of constant coefficients as well and evaluate the results under the 
CCITT standard. If a comparison of the results showed that it would be advantageous to force a particular bit to a given 
value in some cases, on might then be able to increase the number of "zeros" in the binary representation of these 
coefficients in order to decrease further the silicon area required to implement the corresponding multiplier. Once again, 

bit manipulation is not necessary. 

In summary of the above aspects of the present invention, the following is disclosed: an apparatus for transforming 
data having a first latch defining a first data stream source and a second latch defining a second data stream source. 
The first and second latches are in communication with a single arithmetic unit. The arithmetic unit communicates data 
to a transpose RAM, the transpose RAM transposes the data and communicates it to the second latch. The second 
latch is adjustable and can be varied in size to accommodate variable rates of data being received and transmitted. The 
second latch and first latch communicate 1st and 2nd data stream to the arithmetic unit sequentially, however, the 
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sequential communication of the second latch does not interrupt the communication from the first latch In this mannpr 
Detailed Description of Invention for Time Synchronization 

r 0 ' * ^r EG " 2 ' video and audio da,a * synchronized using information earned in the MPEG-2 systems stream In thte 

' 3SS ? ere ^ eSSenta " y t " 0 ,yP6S 01 informa,ion ■*« deaI ^ synch-onization; iSSbXZSSZ^Z 
Clock references are used to inform the decoder what number is used to represen the time W tST fc 

T-me^r " 6n,ed 9t rC9Ular in,erVa,S S ° ,h3t 1,16 ^Z^y^T^Ze c^eS is 

n MPE <* multiplexed inlo !he sjrefem etrearn ere a series of dock references. These olook reference. doI in. ,h. 
references ere ^h'!^^ "T 2*? b «~" PCRs and is « "*«"« 

When decoding MPEG, discontinuities in the concept of "system time" may occur For instance in an edited hit 

wnn respect to a system time defined by a clock reference from another regime will clearly lead to incorrect r e <..L 
Figure 39 shows the demultiplexing of the MPEG systems stream into 7lemenarv E?rh 1m J^!' 

stream wH. typically carries either video or audio data alSough. in genera? mS^SS!!^££S!!^ 

tt^^S^SJZST - ~ ° f Vid6 ° - a ~ - - 

tern 2° multip,exed in, ° the *■»"■ stream « a series of clock references. These clock references define the "sys- 

Th. aec « dance wilh , the P resen « in vention. associated with each elementary stream is a series of time stamos 251 
presented. These time stamps are referred to as presentation time stamps, "PTS" 

<*JH!Z££- * Vl ? e ° dS,a ' 8 SeC ° nd type ° f time 5tamp is a,so def ined is re 'erred to as a decode tone stamp "DTS" 
SZSr^iT ""^T WhCn 9 PTS iS al5 ° PreSent and there is a sim P |e relationship tSmmS^tlSZi 
Ze^entn *~ ^ ^ * ,imeStamPS be ^ 5ince PTS/DTS d «— " a - S !2S2 

h^J!,^ 6 time 8,3,11,35 (DTS) define the ,ime at which a " a «=ess unit (picture in the case of video) is to ba 
£22" .^ e « PreSe ^!!T 5,3,11135 (PTS) dS,ine tme at *** an a «=ess unit is to be^sent^SlyeS) 

p^s^ 

In accordance with the present invention, it will be appreciated that to properly synchronize time, it is necessary to 
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be consistent in the use ol time stamps. In one preferred embodiment, the time synchronizing circuitry is placed at a 
^^^^Gn. when toe pictures occur in their decoded order. Accordingly, this embod-ment uses the 

DTS 

Neverthe.ess. the circuitry could equally be moved to a point in the decoding pipeline 
are reordered and therefore, the pictures would reach the synchronizing circuitry in their display order. Hence, as will 
he aoDreciated bv one of ordinary skill in the art. PTS would be used in this embodiment. 

'SC^M of the present invention, the information derived from the timestamps is transported 
through the various circuits by means of tokens. Tokens consist of a series of one or more words of information Jhe 
irS word of Z token contains a code which identifies the type of token and. hence, the type of information earned by 
ha token Asidated with each word of the token is an extension bit which is set to one to indicate that there are more 
worts i i, ^cu?rent token. Therefore, the last word of a token is indicated by the extension bit being zero. In the present 
^SSS^SSZ first word indicating the type of token may be of a variable number of bits so that some codes 
Z! a small number of bits (allowing the remainder of the bits in the f irst word to be used to represent other information) 

tem decider and the video decoder, there are two types of information: (1) the coded video data and (2) toe synchro- 
nizationTme derived from the time stamp information. The coded video data is viewed as data and .s earned ,n a DATA 
token (e g the token called DATA) while the synchronization time is viewed as control informat.on and ,s earned in a 
coTo take T (called SYNC TIME). Additional control tokens may also be used from time to time in the presen ,nven- 
«on FcT Sample, a FLUSH token that behaves in a manner similar to a reset signal may be requ.red to m.t.al.ze the 
video decodina circuitry before attempting to restart decoding because of an error. 

* a^coidTce withthe present invention, it is an object of one preferred embodiment to time synchrony two ar- 
cuits and more particularly, to time synchronize two circuits without directly communicating system time from the first 
to the sto^ ciSiit. .n accordance with the invention, time synchronization of two circuits is accomplished without 
passing system time directly to the second circuit by providing synchronized time counters in each circuit. 

The present invention also enables the system to time synchronize two circuits without communicating system time 
from the first to the second circuit by providing an elementary stream time counter in each circuit. 

Accordingly, another object of the present invention is to time synchronize two circuits and to determine the pres- 
entation time error, if any. of the object being presented by using time stamp information, system time, and etomentary 
stTSm time from the first circuit to generate synchronization time passed to the second circuit and compared to a copy . 
o efomemary stream time in the second circuit which is synchronized with the elementary stream time in the first circuit. 
The system of me^eU invention can time synchronize a system decoder and a video decoder without directly com- 
mun&Csystem time from the system decoder to the video decoder, without passing system time directly to the video , 
decoder by providing synchronized time counters in each circuit and without communicating system time from the sys- , 
tem decoder to the video decoder by providing a video counter in each circuit. 

?he^vemion also enables the system to time synchronize a system decoder and a video decoder and to deter- 
mine the display time error, if any. of the picture being displayed by using video time stamp information system time 
a^d Jdeo ScSing time from the system decoder to generate synchronization time which ,s then passed to he video 
decoder and compared to a copy of video decoding time in the video decoder which is synchronized with the video 

^Tacc^ intormatfon derived from the timestamps can be transported through the 

svstem usina a control token as previously described. . 

Figure 40 shows a first preferred embodiment implementing elementary stream timestamp management. ,n 
accordance with the present invention. The clock references 253. which represent system time, are decoded bythe sys- 
? e rdemu"ptxer 2 £ and placed initially, and then as needed, into a time counter 255 within the system *«*rttl 
and are incremented at 90 kHz. A second copy of the clock reference 253 is simultaneously loaded ^ 
255 that is inside the elementary stream decoder 257. incremented also at 90 kHz. and synchronized to the time coun- 
ter 255 in the system decoder 256. . . 

The time stamps 251. in accordance with the present invention, flow from the system demux 254 through the ele- 
mentary stream buffer 260 so that they are delayed by the same amount as the incoming data. The time stamps 251 
may also have a correction added to compensate tor the non-zero decode time of the elementary stream decoder 257. 
The corrected time stamps 251 are then compared with the copy of the time stored the time counter 258 inside the ele- 
mentary stream decoder 257 to determine whether the decoded information is presented too early or too late. 

The above embodiment is better than merely passing system time directly to the elementary stream decoder 257 
from the time counter 255 in the system decoder 256 because the counter in the system decoder changes 90.000 times 
a second. Therefore, system time would, in all essence, need to be continually passed to the elementary stream 
decoder 257 Passing system time continually would require dedicated pins or the like. By using a t.me counter 255 
located in the system decoder 256 and a time counter 256 located in the elementary stream decoder 252. system time 



35 



EP 0 891 094 A2 

can be passed in the form of clock references 253 a few times a second 

isis^zassr in h Fi9 r 1 avoids the ^ ,w - ** ■* 

262. containing information on e.emen ^ 
■< elementary stream decoder 257. The two es time cWereS and^2M ^JL , ^ deCOder 256and 1116 
as channel change and then thev free run from^o™ ^ c ,t ^ 6S6t at P™ 6 ' on> and at 0,her times such 
262 and 263 ^TJ^^^^SS^TJT^ ^J™* 

do not get out of s?ep One Z to eS JS?1 HIT £° ^ m6aSUreS 10 enSure ,he es - time «"*•» 

esjime counter in £ t TlZS^ ^ ?* h S,eP * t0 USe ^ 0ut of the 
o in Figure 41 J me COun,er ,n the elem entary stream decoder 257 as shown 

puted using the elementary streanTtimeS™ th» Accordingly, a synchronization value X is com- 

Equations 3-1: 

a) Time Synchronization = (Elementary stream timestamp - system time) 

b) Time Synchronization = (X - elementary stream time) 

c) (X - elementary stream time) = (elementary stream timestamp - system time) 

d) X = (elementary stream timestamp - system time + elementary stream time) 

z^d^^SSZ^ XSSEEStt "** **! 3 COrreCli ° n 3dded to COm <> ensate *>' *• non- 
elementary stream ime Slffi? toe e fS w^^St^T^ *"* * then with the 

counter es timV?M .* , ' a ' Ue ° f Sys,em t,me at me instant tna * »e elementary stream time 



Equations 3-2: 



a) elementary stream time = system time - initiaUime 

b) X « (elementary stream timestamp . system time + [system time - initiaLtime]) 
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c) X = (elementary stream timestamp - initial_time) 

Two solutions for deriving the synchronization time. X, in accordance with the present invention have been illus- 
trated However it will be apparent to those skilled in the art that there are many other equivalent solutions. 

Fioure 43 shows another embodiment of the present invention implementing video timestamp management. The 
clock references 253. which represent system time, are decoded by the system demultiplexer 254 and placed initially, 
and then as needed, into a time counter 255 within the system decoder 256 and are incremented at 90 kHz. A second 
copy of the clock references 253 are simultaneously loaded into the time counter 258 that is inside the video decoder 
270 and incremented at 90 kHz. and synchronized to the time counter 255 in the system decoder 256. 

The video time stamps flow from the system demux 254 through the video decoding buffer 271 so that they are 
delaved by the same amount as the incoming video data. The video time stamps may have a correction added to com- 
oensate for the non zero decode time of the video decoder 270. The corrected video time stamps are than compared 
with the copy of the time in the time counter 258 inside the video decoder 270 to determine whether the decoded picture 

is displayed too early or too late. , 

The embodiment shown in Figure 43 is an improvement over the process of merely passing system time directly to 
the video decode, hom the tome counter in the system decoder because the counter in the system decoder changes 
90 000 times a secorxj Therefore, system time would in all essence need to be continually passed to the video 
decoder Passing system time continually would require dedicated pins or the like. By using a time counter located in 
the system decoder and a tome counter located in the video decoder system time can be passed in the form of clock 

references a few times a second . _ . 

Referring now to F.gure 44. the clock references, which represent system time, are decoded by the system demul- 
tiolexer 254 and placed .nto a time counter 255 within the system decoder 256 and incremented at 90 kHz. The v.djime 
counter 272 in the system decoder 256 and the vidjime counter 273 in the video decoder 270 are synchronized with 
each other and incremented at 90 kHz Video time stamps are also decoded by the system demultiplexer 254. Accord- 
25 ingly a synchronization value X is computed using a video timestamp. the system time contained in the time counter 
273 and the video decoding time contained in the vidjime counter 272 contained in the system decoder 256 accord.ng 

t0 tH Th?Slcwing set of equations 3-3 (a - d) is illustrative of one method in accordance with the present invention, for 
time synchronization which avoids passing the clock reference 253 to the video decoder 270. Equation 3-3(a) is the 

30 equation required for time synchronization. Since it is undesirable to pass system time directly to the video decoder cir- 
cuit 270 as shown in Figure 44. a synchronization time representation X is generated, using Equation 3-3 (b - d). by the 
svstem decoder 256 and passed to the video decoder 270. Synchronization time. X, is then compared to the video 
decoding time contained within the vidjime counter 273 located within the video decoder 270. The compared result is 
used to determine whether the decoded picture is displayed too early or too late and then further used in time synchro- 

35 nizing the system. 

Equations 3-3. 

a) Time Synchronization = (Video timestamp - system time) 
40 b) Time Synchronization = (X - video decoding time) 

c) (X - video decoding time) = (video timestamp - system time) 

d) X = (video timestamp - system time + video decoding time) 

♦ 

In the present invention, the synchronization time. X. may have a correction added to compensate for the non- 
« zero decode time of the video decoder. The corrected synchronization time is then compared with the video decoding 
time contained in the vidjime counter 273 located inside the video decoder 270 to determine whether the decoded pic- 
ture is displayed too early or too late and is also used to time synchronize the system. Note, the time correction can be 
subtracted from the video decoding time contained in the vidjime counter 273 located inside the video decoder 270 
instead of added to synchronization time X for the same result. The above embodiment of the present invention is 
so another example of a solution for generating synchronization time X and determining whether the picture is displayed 
early or late. However, it will be apparent to those skilled in the art that there are many other equivalent solutions for 
accomplishing the above. 

Another nice feature, in accordance with the present invention, is that there is no need to deal with the full 33 bit 
time stamp number or 42 bit clock reference number. The present invention restricts the counters to 16 bits to allow 16 
55 bit handling on the video decoder 270. At first glance, it would appear that 1 6 bits cannot represent a sufficient number 
range at a resolution of 90 kHz (only 2/3 second to be used). However, there is no need for such high precision because 
the time control on the video decoder 270 is only accurate to a field time (since the video timing generator VTG free- 
runs or is gen-locked to something that has nothing to do with the MPEG stream being decoded) and. therefore, it is not 
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related to timestamps or presentation time in any way 

decree a™ te F t: fis^sKKwr - co r 273 wHhin ,he ^ 

allows timing ca.culations with an accuraci jof aSSS " 93 84 ^ 16 bite 

through 'IT ^ inV6nti0n - ° f the m0vin « the stamp 
appreciated that alternative SSSiESSL ta222f!2 i ^ iS Tokens - but » wi » be 

ule is added just after the ScSe^eteS ^T?5 JTf ' S diVided in, ° two modules The first mod- 

containing the synchroniSSn time X ^s ^LJZZTV* reS P° nsib,e tor 9 ene ^9 a token. SYNC TIME 
PICTURE START token. In the MPEG systems , ^ "? ^ ^ jUSl bef ° re an 

first picture in the packet of data Since 21232 * ^J? ^f^?* 5 ^ in a f**« header and refers to the 
of the previous picture before £ SEESE 355^2^^ ^ * * ^ ^ ^ ^ 

or^ 

(d,v,ded ,nto two parts to allow access to each byte individually), is furS deSSTn " 6 ^ 



Register 



tsjow 



Size/Di 



8/rw 



Reset 



Description 



tsjiigh 



8/rw 



The lower eight bits of the synchronization 
time value. 

The tsjow register is slaved so that 
new values may be written into this register 
without affecting the value previously written 
(that will become part of a SYNCJTIME to- 
ken). 

Writes to tsjow register affect the 
master register whilst reads read-back the 
slave register. Until a master -to-slave trans- 
fer has been effected using ts_valid the value 
written into tsjow can not be read back. 

The upper eight bits of the synchronization 
time value. 

Slaved in the same way as tsjow. 
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Register 



Size/Di 



Reset 



ts valid 



l/rw 



ts-waiting 



l/ro 



Description 



This bit controls the master-slave transfer of 
tsjow and ts Jiigh. 

When values have been written into 
ts Jow and ts Jiigh the microprocessor 
should write the value one into this bit. It 
should then poll the bit unit it reads back the 
value one. At this point the values written into 
tsjow and ts Jiigh will have been transferred 
into the slave registers (and can be read back) 
and ts-waiting will be set to one. 

The microprocessor should then write 
the value zero in preparation for the next ac- 
cess. 

When set to zero the registers tsjow and 
ts_high do not contain valid synchronization 
time information. 

When set to one the registers tsjow 
and tsjiigh contain valid synchronization time 
information. A SYNC_TIME token will be gen- 
erated before the next P I CTU R E_ST ART to- 
ken and ts_waiting will then become zero. 

This bit should be polled to ensure 
that it is zero before writing a one into ts_valid 
to ensure that the previous synchronization 
time value has been used before it is overwrit- 
ten by the master-to-slave transfer. 



Table 12 Microprocessor registers for handling synchronization time 



In the present invention, a flag, ts-waiting. is set to indicate the fact that valid synchronization time information is in the 
timestamp register. If the data was supplied using the SYNCTIME token, then that token is removed from the stream 

of tokens. . . 

When a PICTURE START token is encountered, the flag that indicates the status of the synchronization time reg- 
ister is examined. If the flag is not set, then no action is taken and the P I CTU R E_STA RT token and all subsequent data 
is unaffected. If. however, the flag is set. indicating that valid synchronization time information is available in the register, 
then a SYNC.TIME token is generated and placed in the data stream before the P I CTU R E_STA RT token. The flag is 
then cleared and the synchronization time register is made available for the next time-stamp that occurs. 

The second module as shown in Figure 46, consists of a prescaler clocked at 27 MHz and a vidjime counter 
clocked by the prescaler 278 which are associated with the microprogrammable state machine, (MSM) 218. 
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nooawni* T" """" °' me P,esenl inven,i0 " «" "^*™n»d with Mv clocked feed-beck (fc, 

dec<*e, stays hCS^St^i^^SS."" ^ *" *" '"~ " " *" — 

~ttX£E&™J£t!^JSZS " SVN ^- T r E "~ ~ » 

video SYNP timp » «! 7 counter and to then compare it with the value supplied bv the 

ISiSir ,here,0r6 ' d6,ermine Whe,her * is «* » .ate. as compared ,o the fS^SSl 

^)^^!S^^s1^^^ C Z!T n I 3 * 601 f ° S ^ h --«°" «me X (discussed 
through the video and audio decoders synchronization l,me and. thereto™, compensate tor ditterential delays 

-^"SS^^^ 7- - «M «'. - ■» Present intone*, 

ken and the most reliable wav to correct the ~ 7 tsse ™a»y. the decoding of the sequence is bro- 

access or channel chan« ,1 LlrTl . 1 , f ^ ,he decodin 9 P rocess a manner similar to random- 

to dZ£ SZ " SJSJI^J^fSS^^ COntr01 r69i5ter ° f ,he MSM 218 may be ^mmed 
lc nrYf ^„ . a 5U,iaD,e start code or FLUSH token is encountered. In addition the error "ERR Ton pari v~ 

^r^'" 9 — because 1 :, s™, £2.™^ 
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Register Na me I Size/Dir 

ts correction I 16/rw 



frame_time 



1bvid time 



manual_startup 



Reset State 

zero 



16/rw 



226 or 188 



16/ro 



zero 



l/rw 



zero 



Description 

Correction added to synchroniza- 
tion time before it is used. 



Represents the tolerance on the 
timing of decoding pictures. Reset 
state determined by the PAUNTSC 



pin. 



Reset by either reset or 
reset_time. The current value of 
video decoding time. 



When set to one the start-up is to 
be performed manually using 
decode_disable. In this case 
SEQUENCE_END and FLUSH 
tokens at the MSM cause 
decode_disable to be set to one. 
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Register Name 

decode_disable 



Size/Dir 

1/rw 



Reset State 



zero 



disable_too_early I 1/rw 



zero 



NTSC_30 



1/rw 



zero 



discard_ifjate 



1/rw 



zero 



Descript ion 

When set to zero the decoding 
proceeds normally. 

At the start of each pic- 
ture the MSM checks the status of 
decode_disable and will not pro- 
ceed if it is set to one. 

Note that If manual start- 
up is to be performed (i.e. without 
the time-stamp management 
hardware) then this bit should be 
set to one at the same time as 
manual_sta rtup is set to one. 
When set to one the error 

"ERR_TOO_EARLr indi- 
cating that the decoding is too early 
is suppressed and the MSM simply 
waits to cor rect the situation. 
When set to one the prescaler di- 
vides by 4804.8 rather than 4800. 
Set automatically when decoding 
30 Hz frame rates. 

This has no effect unless an 
•ERRJTOCLLATE" is generated 
(or would be generated if errors 
were not masked out). If it is set to 
one then data is discarded until the 
condition indicated by 
discard until. 
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Register 
name 

discard_until 



Reset State 



zero 



Description 

Indicate the condition which causes 
time-stamp triggered discarding to be 
terminated. 

0 - FLUSH 

1 - SEOUENCE_START 

2 - GROUP_START 

3 - NEXT PICTURE 

Note 1 - that discarding one 
picture may immediately be un-done 
if that picture is a field picture by the 
generation of a dummy field to pre- 
serve the alternating top/bottom field 
structure. As a result if discard_until 
is set to "Next Picture" but the dummy 
field would be generated one further 
picture is discarded. 



Table 13 TImestamp MSM registers 



As a result of the synchronization time handling of the present invention, it is possible that one of two errors will be 

9en ERR i TOO EARLY is generated if the decoding is taking place earlier than the time indicated by the time-stamp^ 
ERR_TOO_EARLY may be suppressed, but ERR_TOO_LATE will always be generated unless all errors are masked 

0Ut In summary the present invention includes: an apparatus for synchronizing time having, a timestamp for determin- 
ing presentation time, a clock reference for initializing system time in a first circuit, a first time f^J^^T^^ 
with the clock reference for keeping system time in a first circuit and a second time counter initialled by the clock ref- 
erence in a second circuit synchronized with the first time counter, for keeping a local copy of the system time and for 
determining the presentation timing error between the local copy of system time and system time byj ^ om P« r,n 9^ 
timestamp to the second time counter. It further includes an apparatus for synchronizing a system decoder and a video 
decoder using a timestamp for determining display time, a clock reference for initializing system time in the system 
decoder a first time counter in communication with the clock reference for keeping system time in the system decoder 
and a second time counter initialized by the dock reference in the video decoder synchronized with the first time coun- 
ter for keeping a local copy of system time and for determining the display timing error between the local copy o system 
time and system time by comparing the timestamp to the second time counter A still another embodiment '"eludes an 
apparatus for synchronizing a first circuit and a second circuit using a clock reference for initializing system time in the 
first circuit a first circuit having a time counter in communication with the clock reference for keeping system time, a first 
elementary stream time counter in the first circuit for providing elementary stream time. The first circuit is adapted to 
receive a time stamp, and the first circuit generates synchronization time by adding elementary stream time to the time 
stamp and subtracting system time. The second circuit is adapted to receive synchronization time from the first circuit 
and has a second elementary stream time counter in synchronization with the first elementary stream time counter for 
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t^!^ ? «-™«-J a timing error between the system time and 

reference signal does^ K fS 22S °' etemen,arv ^ream «»r,e. In this way. the clock 

another embodiment, an appTatus for ^roSJu^J^f^ M l ° determine ,he timin 9 e ™ 

5 izing system time in the Scircuft circurt has a tirT^f M h3S 3 C,0< * re,erence *» ini « al " 

keeping system time, and a first vide^meco nt r fo proSng^'deSS^ ? refe ' ence * 
receive a video time stamp and generates synchronization Z» h ! Ihh de ° od "? tm * 1116 f,rst circuit is adapted to 
and subtracting system time The 3 S 1 * 9 Vlde0 deCOdin9 time to the video 'i™ stamp 
a second video time counterr sy SnSn s ' nchr ° ni2a «°" time from the first circuit and hS 

,o decoding time and fordeterminmg^ 'oca. copy of video 

nization time to the local copy of video dUxiina ^2™^^^ 

passed diredy to the second tircui, in *" 001 have t0 be 

for providing timing information by providing ^2, data siream hiTn ? a ' S ° inCludes a me,hod 

the time stamp refers to the first Picture in me rcktfS H»t- , 9 S,3mP Ca,ried in P 3 *** header wherein 
r 5 to indicate valid time stamp infoSon whichSln 8 re9 ' Ster ' S prOV,ded havin 9 a ,,a 9 

timestamp is removed frZ Ih^S^SiJ^Si? T" "* int0 *" regis,er - Ne *' *e 

start and subsequently examine J£JL « and f ,a f ed ,n ,he re 9 ,ster - "ext. the method encounters a picture 
the "^erby^^ information is contained in 

valid time stamp information is contained in the - SSnSTt^T^ P,CtUr3 St3r1 * me f ' ag indicates 

0 Another embodiment includes an m^oSS^^S tmestamp « .nserted back into the data stream, 
restricted to 16 bits Likewise there fc -Z ^„E?SS abo ^« wh u ere,n the elementary stream time counters are 

counter .ocated in the SemelT 5r amSer is SSS to TebtT'^ Stream ,ime 

described ^^•fctthe.^^J!2^ 1 lS^ 1 J h 2S te ' F ^ e "™«. there is.an apparatus as 
The present invention also has a process toJd^i L JL J* 8 ° r con,rol,,na tne elementary stream decode. 

1 value. It then parses vidS £ hSSKSSS fo ^eterm,ning display time errors against a threshold 
paring the time stamp S^to.SSS h" ^ ^ * COm " 
Next, it determines whether the comoared vale i^o™ C °T pared value ,0 determine an indicative of timing error, 
eters when a timing error * h , reshoW "J* * wrthin acceptable param- 
alternative embodiment includes an aooaratus a c . COn ? ared value ,s outs 'de acceptable parameters. An 

i is adapted to accept MPEG system stnJarS an^d^ n, System decoder and a video decoder. The system decoder 

The system decoder has Iffff time S ^SS^S^ time Th" "S ™? T ^ ^ the Stre3m 
and the video time stamo and ha* a e^TVv! . ^ 1 e " Tne v,deo de coder accepts the video data 
decoder alsThaT *2*3£L!^ synchronization with the first time counter. The video 

video data at a Jy^e^X^ a ^T^^ 7*° *? * 3 substan,ia,lv constant rate and outputting the 
video data also coi^^^^^,^^ J? v,deo decoder while decoding a picture from 9 the 
appropriate displaying T^Wt^SSS^ ^ ^ SeC °" d time COUnter to de,ermine tne 

by providing the'fii circuit a s^meTs^ ItSSp'S ^^2^ V"' ^ ^ 3 B8B0nd 
synchronization time <X> by using the etementiry^ < E ^' oWai ™n 9 

accordance with the equation- X-ET * T<4 <?v Z^.T um ^ , > : Ir, e time stamp (TS) and the system time (SY). in 
a synchronized time W to ^ second circuit and generating 

ance with the eouation m^^r!? t *™ error bv ^"9 synchronized time (X) and in accord- 
system time toThe second JkTS^^SS^L * Synchronized with second circuit without passing 
curt has the M^Zs Z^ll Z^ T J 6 ?"" 9 3 t,me 6,TOr between a first « rcu « and a second cir 
nization time (X) * - S C J2^^!^"f ^ TO and 3 ° initi3 ' ,ime(l ^ «****o ^chro- 
providing synchronizaZ time (xTfoTheTilSi t ^T* U ™ ^ ,n accordanc e with the equation X=TS-I . 
and obtaining a SSt^ST^ l^r^ZT^T^ ^ nchroni2ed -"«tary stream time (ET) 
first circuit can be time syniSS^TrS^ 2? accordance wrth «»e equation ET-X. In this way. the 
another method for deter^n^^^ P 355 '" 9 System time to the second circuit Still 

viding the first circuit with a syl J tTme (^) ?So Sme sS,™ ™ T ^ inC,Ud6S 4,16 fo,,owin 9 ^ps: pro- 
chronization time (X) by using the ^SdKb^^ J»<^- ««**b syn- 

accordance wrth the equation X=vT+VTS-SY ™™ J;™1LT* v,ae °. t,rne s,am P (vTS) and the system time (SY). in 

Detailed Description of the Invention for Asynchronous Swing Buffering 

For asynchronous swing buffering, in accordance with the present invention, two buffers are operated asynchronous.^ 
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one is written while the other is read. Accordingly, this allows for a data stream having a first rate of hrough-put to be 
SSSSS^ a^r^ while still maintaining a desired rate. In the invention, the write contro and read contro 
SrSrSSe inXators for communicating which buffer they are using and whether the controls are waiting for 
£1 or ateln S acceSng that buffer, iach side communicates to the other sfcie a single bit to ind.cate wh.ch 
\Z usfng TnS'is the only signal that must be synchronized between the two sides of asynchronous circuitry. 
When one control circuit (read or write) finishes accessing a buffer, then the invention will allow control to pass to 
the Z^cSt Tafte° S control has swung, and two control circuits are trying to use the same buffer, then the later 
!^^b£«£Thi control circuit will wait until each side is using alternate buffers, i.e.. the other side 
ZE^^^tZZh * finds that it is now using the alternate buffer to the other side, it w* not wart but 
^mmSely commence accessing. This system of arbitration between the buffers is started up by both buffers using 
to^el^buZo. in this case. The read side starts up by waiting, while the write side ,s accessmg. s,nce there 

" " TZ^^Ta^lTL the present invention, the swing buffers are two discrete RAMS having a.. 

sutf TaTeraWing strobes addresses and data multiplexed from either the read or write s.de. dependent on 
ZStSZ is ^e^aSsfed bTeach side. This structure has been shown to use a ,ot of area in the busing of a large 

nU ^^S~- saves much of the busing area while st, 

to the same standard This structure contains twice as many rows of cells as one of the discrete RAMs found .n the first 
enSodTmTnl £E present invention. However, the second embodiment must have two pairs o brt lines s.nce the read 
II to the disaete buffers is happening simultaneously and asynchronously. Each row w,ll be of ,ts original width 
« e have" ^same number of cells) since accesses are the same width as for the discrete RAMS. Each pa,r o rows 
are' accused asrtat the same address, but from different buffers, so they connect to a different pair of brthnes. Using 
fhe Smelddress tL pair of rows can be readily accessed by one row decoder connected to the read address and 
one™ decoder connectS to the write address. Again, the read and write control never access the same buffer at the 
„ me t i m e so there is no conflict as to which pair is accessed by which row decoder. 

Tt^e SrTe way in which each row decoder can access rows from each buffer, both the read and write circuitry, 
within t^e iruciro the present invention connect to each pair of bitlines. one pair from each buffer. The read and 
wrrtes are thenmu^iplexed into each of the buffers and. for the same reasons explained above, there w,ll not be conf Let 
^s^how^inRgure 48, a swing unit 1 includes swing buffers 1 0 with RAM 12 and 14 in accordance w,th the present 
inventor , The S unit 1 also indudes a write control circuit and a read control circuit, which control the data unto and 
^ of the RAM^and 4. The read contro. circuit and the write control circuit accomplish this by use of strobes data 
»nH arid?e2 Toontroinnes 8 Lines 7 and 9 are control lines to indicate the RAM used by the write control circuit and the 
RAwfused^y^n Line 7 functions to contro. the write contro. circuitry, i.e.. when the read cor** 

cir^itry? using RAM ?2 if .ow. RAM 14 if high. Simi.ar.y. Line 9 functions to inform the read contro. crcurtry that the 

write control circuitry is using RAM 12 if low. RAM 14 if high. 

^n Te o esent invention swing buffer 10 has two RAM arrays. 12 and 14. Swing Buffer 10 is capable of asynchro- 
nous ISern^^ wring to the RAM area which enables the apparatus to achieve the necessary band 
S to 5 high speS accessing of the memory. The RAMs 12 and 14 require the following signals: wnte address 16. 
Tead address 18 data in 20. data out 22; and a read and write enable signal (not shown). See also Figure 49 

Th^wme address and read address signals are multiplexed by multiplexers 24. The RAM array 12 and 1 4 operate 
with the write circuitry, row decoder and read circuitry in a conventional sense. . 

In the first embodiment of the present invention, during initiation of the swing buffer 10. RAM 12 will be written to 
■ mtii the control circuitrv switches a write enable single to RAM 14. 

Once th^ ^RAlTar^2 has been written, ft falls under the control of the read circuitry 4. to be read. During , this 
time the RAM aVray 14 is also being written, ft is important to note when the RAM array falls under the control of the 
Sa^ray contro. 2%r the write coniro. circuit 4. the contro. is established until reading or wrrting is complete and then 
coTtro is turned over. In the situation where the read control circuit 4 is accessing the RAM array such as 12 and the 
wTte control circuitry 2 needs to access the same RAM array 12. then the write control orcuit will begin warting. 

tEES^ in accordance with the present invention, two control events are created. When a ^ * 
a read contro. circuit swings to a different RAM. it will either begin immediately accessmg the RAM ^e the RAM « 
freeand not under control of the alternative circuit, or it will begin to wait. During start up. the read side defers to the 
write side, since there is nothing valid to be read out of either buffer. ^K, rf ,„, nin ,i,^ B c 9 

The second embodiment of the present invention is shown in Figure 50. An mtegrated sw.ng buffer 30 includes a 
RAM array 32 having the logical size of RAM array 12 combined with RAM array 14. In other words, there .s the same 
; ^^SS^t^Mt and second embodiments, however, ft is combined in the second ernbodiment Accord- 
fn7v the irtegrated swing buffer has the advantage of saving much of the busing area while still performing the same 

W 'Tn^ r etSembodiment of the present invention, the write circuit and read circuit 34 and 36 respectively, are sim- 
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access the <o»s 44 in pains. A pS S raS hSa 21?" ' feC ° < * !, 38 and *"» deecder 40 

butfe. Una 52 and write ^er linTs? e^toel """"^ "* *' a ** ess lines 16 and 18. The read 

0 ticines 48 end bZ Vbini M o .££ *'T Se,eC, " ,!! dne c4 the pei,ed rows 42. Tiie 

Fo, clarity in depict™ the adSoT S °f fe 10 11,6 ,ead a,M "*»*» 34 end 36. 

coe™nica«on «, the ^^,3^X21™ ST " T "T 1 * CU " " 
cuit and read control circuit are in communication , «,nh .u . ., ^ Furthermor e. the write control cir- 

There is also a swing buffe E^STEZE^ * ^ ' Synchronized «** of the RAM arrays, 

through a pair of Sm*a reS contofriS 8 "** " Wr " e C ° n,r0 ' drCUit in c °™™nica1ion with the RAM array 

and a'read'ow d^an, ^ Z^Z^^ZT^ ^ ** °' brt ,in ~ 

are read. The present invention also o^Wes a mSh^ ^ ^ """^ S pair ° f rows 80 that indivi °""al cells 
of cetls in the rV using EE EEXSEEES SEESSXSES? ^ ft* 00 * 8 * ^ 9 
us,g at .east two pairs of bitlines connected to rea d ^^Z^^l~^^ 

DETAILED DESCRIPTION OF THE INVENTION FOR STORING VIDEO INFORMATION 

™tpa^^^ and display picture information. The three 

present invention C ' S ^ d6C0der and 4,16 **> The 

video formatter ma^ge tteVrSve^^ ^ Way Wh ' Ch the tem P° ral decod * r «nd 

b-^ssssss^ &di r r y interpoiated < b) b *-«• « 

no further decoding by the temporal Si but m^ h» E^T? ° ne ,r ° m the ^ ' P ictures 

decoding P and B pictures De^naTp^^l r^ , ^ °" e ° f the lwo ,rame store buffere *» '*er use in 
The decoded P Xre is storSln a &5Z2?^£T"- predicbons ,rom a P""*"* decoded P or I picture, 
require predictioJsTom £E5£ aS^™ decoding further P and B pictures. B pictures can 

It will be appreciated that I and P 5J£E£2 «■ I^T* fJ**"™ *'* n0t S, ° red in the frame store bu «^- 
I and P picture?^ ZSml^S^SSSi s£r SJT the h ,em P oral ^oder as they are decoded. Instead, 
picture arrives for decoding In other words E^EEiEES ^ OUt ° n,y When a subsequent I or P 

pictures out of the two p^^^SE^TEtE^ I"* °" subsec ' uent P or 1 P j ««^s to flush previous 
P pi«ure when it is neces*^ toJLsh th SKiEE?*!?*" ° f ^ 1X686,11 inVenti ° n provide a ,ake 1 or 
when a subsequent video Suence begins ' ^ ' rame St0re buffere - turn ' ,nis fake is flushed 

As shown in Table 14. the picture frames are displayed in numerical order. 
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Table 14: Frame Stores 
milted in the o^ & \teto^Z^^£^^^l f" " intra frame (l frame >" ^ 1 fra ^ a ^ trans- 
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Np*t all the B frames to be displayed between the P4 and P7 frames are transmitted, i.e.. corresponding to B5 and B6. 
Then t" US .10 is transmitted. Finally, all the B frames to be displayed between the P7 and .10 frames are 
transmitted corresponding to B8 and B9. This ordering of transmitted frames requires only 2 frames to be kep in menv 
^^mSSJm at any one time, and does not require the decoder to wait for the transmission o the next 
P f r^e or I f^ame to display an interjecting B frame. As described above and shown in Table 1 4. the temporal decode 
of the present invention can be configured to provide MPEG picture reordering. With this 

of P and I pictures is delayed until the next P or I picture in the data stream starts to be decoded by the temporal 

decoder. , _ . 

As the P and I pictures are reordered, certain tokens, i.e. Picture_Start. Picture.Type. and Temporal Reference, 
are stored temporarily on the chip as the picture is written into the picture buffers. When the picture .s read out tor d.s- 
p ay Tie stored tokens are retrieved. At the output of the temporal decoder, the DATA tokens of the newly decodec P 
or I picture are replaced with DATA tokens for the older P or I picture, and they are then sent to the v.deo formatter. Note 
°hat the output from the temporal decoder is in tokenized macroblock format and there is no block-to-raster conversion^ 
In brief the video formatter of the present invention stores two framestores or pictures. In some video formatters 
three pictures or framestores are used to accommodate such features as repeating or skipping pictures. The v.deo for- 
matter's off-chip DRAM holds three framestores. The use of three framestores here allows frames to be either repeated 
or skipped in situations where the frame rates of the decoded video and the display are different. 

All I B and P frames are stored in the framestores of the video formatter. At any one t.me. there may be one frame 
store from which data is being displayed, one frame store into which data is being written, and in video formatters w.th 
three framestores. one other frame may be being stored in the third frame store. 

The present embodiment performs all the prediction, reordering and block-to-raster tasks MPEG normally handles 
bv using a temporal decoder with two framestores and a video formatter with two framestores. i.e for a total of four 
framestores. This is accomplished in the present invention by using a frame store sharing scheme that only uses .three 
framestores. The present embodiment cannot, however, handle the repeat and skip frame tasks of a video formatter 

with only the three framestores. . . 

The present invention stores I pictures in a first frame store and P pictures in a second frame store Because of the 
need to perform the block-to-raster conversion. B frames are stored in the manner detailed below in a third frame stora 
In order to minimize the amount of external DRAM required, a scheme is used where successive B frames share the 

Sam Whena fr BTraSe is decoded, it may refer to the two previously decoded I or P frames occupying the first and second 
framestores The decoded B frame is written into the third frame store. The present embodiment allows the raster to 
commencG prior to a frame store being completely filled. The raster is allowed to start before 

that the next B frame can be written into the same frame store to occupy the space vacated by the raster at the top of. 

thS border to keep a record of which parts of the frame store are occupied with picture data, and which are available 
fa hew data each frame store is split into sectors. In the present invention, each frame store is first spirt into two field 
stores, each'of which comprises N sectors, where N is the number of block rows in the f .eld. 

Frames coded as field pictures are straightforward. Each successive macroblock row occupies two sectors in a field 
store Once the write back has progressed far enough down the frame, the raster starts reading out each ^sector from 
40 the top. Once the write back of the first frame has been completed, the start of the next frame is wrrtten into the space 
left by the raster. Checks on the status of each of the sectors ensures that the sector to be rastered is indeed full, and 
that for write back, the two sectors required are empty. . 

Frames coded as frame pictures are more difficult. Unlike field pictures, the macroblock rows of data are not written 
to the DRAM in the same order as they are to be rastered. The field stores are written to in parallel, whereas the fields 

45 are rastered in turn. , . _„^ u 

Consider a picture with 8 sectors per f ield store. That is. Field store 0 consists of 8 sectors numbered 0 to 1 each 
of which contains one row of blocks (i.e.. each 8 pixels deep by the width of the picture). Field store 1 consists o 8 sec- 
tors numbered 8 to 15. each of which contains one row of blocks (i.e.. each 8 pixels deep by the width of the picture). 
'The first macroblock row is wrrtten back into sector 0 in field store 0 and to sector 8 in field store 1 . The field stores 

so continue to be filed in parallel. At some point, the raster beings displaying sectors from field store 0. that point being 
chosen so that the raster of field store 0 does not catch up with the write back. However, the second frame cannot be 
written back in the same manner as the first. Because the sectors are written and read in a different order, waiting for 
the same two sectors to be free at the start of a frame would mean that write and read could not run continuously. This 
must be achieved in order to maintain the display and to maintain decoding at the necessary rate 

55 Accordingly, the second frame must be written into sectors of the frame store already freed by the raster This is 
implemented by dividing the framestores in two. Hence, for the second frame, the meanings of the half field stores 
change Sectors 4-7 become the upper part of the second field store and sectors 8-1 1 become the lower part of the first 
field store i e they swap over. The first macroblock row is written to sectors 0 and 4. once they are freed, with subse- 
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quenl rows written to 1 and 5, then 2 and 6 and then anrt 7 Th a „^ 

The FIRST picture is written back to - 
Sectors 0 and 8 [1st macroblock row=2 block rows] 
Then 1 and 9. 2 and 10, 3 and 1 1, ... 7 and 15. 
The FIRST picture is rastered from - 
10 Sector 0, 

Then 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15. 
The SECOND frame is written to - 
Sectors 0 and 4, 

Then i and 5, 2 and 6, 3 and 7, 8 and 12, 9 and 13, 10 and 14 11 and 15 
'5 The SECOND frame is rastered from - 

Sector 0, 

Then 1, 2, 3 8, 9, 10, 1 1, 4, 5, 6, 7. 12, 13, 14, 15 

picture sizes which dKride into a number of sectore ZT^^n^tl ll ^ * ,m P ,ement - Secondly, with 
from sectors of memory does no* SiSKtSS^ ,n ^ ^ " "** * «"*'•■** 

would repeat every 58 frames). This'makes JmSSi^SSSt " d ^ 

to be read. Thus, each fifo being ^ o^ c^^ela^r!?' P ° in,efS tC> ,hS neXt ,OCation to be writte " ^ 
of the knowledge that each ha^ie,^ ™* — « 

o the t^sxrriszs sets? j* Je ° j — — ^ h 

frame in a second f^me sto7i o^idtn a ? f T St ° rin9 ,he ' Frame in a f irst Frame ^re. storing the P 
f ie.d store being J5S aniZl^^^ ^ and S6C ^ d 

B, Frame from a selected portion of the nJ^2tS^oSSriMiT *" re9iSter ' *» 
into the selected portion of the memory area from whw? *T n c ,0 e; wnta1 ° a por,lon 01 the 62 Fr ame 

* can be used to store video hS ' ^ WaS read; whereby a reduced amour * <* memory 

The two programs found herein be.ow contain code to be used in the preferred embodiment of the invention. 
Detailed Description of the Invention for a Parallel Huffman Decoder 

programmable state machine^) ( ^ ^ thr ° U9h ,okens under ~"trol of the parser micro- 

This embodiment of the present invention handles both MPEG-2 as well a* mpp« 1 u m ^ . 

irregular or non-canonical in nature. MPEG-2 transform coefficient table which is 

Furthermore, this embodiment of the invention has some features that allow t« ^ , ■ 

components from the stream in a sinole cvelo »,rth™ rt «, . , rt t0 decode cer *am more complex 

plex components are aSSSJlSSSSir^^T*^? ^ eX,ema ' C ° n,r0,,er - Exam P ,es of sucn «m- 
the streanv as ccnt^J^S^^ST ^ ^ a " ° f Which are " resent in 

specie .engfh ofjhe FLC. Thus, the ROM « £2 STSJKl^ ** ^ " ^ * 

»e beg inning of the ne* ^^X^ 
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bits ol its 59-bit input and the top 16 bits of these are passed to the Huffman Code ROM 302. For subsequent VLCs. 
the selector effectively shifts the input according to the total count of bits decoded thus far. The count is maintained by 
addina the size of each VLC. as it is decoded, to a running total. The various word widths are a result of the maximum 
coded size wh,ch can be decoded, which is the 28-bit MPEG-1 Escape Coded Coefficient, and the maximum VLC size 
which is 16 bits (DCT coefficient tables). 

The "table select" input is used to select between the various different Huffman code tables required by MPEG. 

The Huftman Code ROM 

The core of the implementation of the present invention, used to decode all the VLCs is a special ROM 302 whose 
addresses are controlled with a selector/shifter 301 as shown in Figures 52 and 53. The ROM 302 has the job of per- 
formina a VLC table index calculation, followed by the index-to-data operation that yields decoded data. 

The index calculation can be thought of as a content addressable memory (CAM) operation with "don't care 
matching implemented to handle the Huffman codes which form the presented data. Since all the VLC code tables are 
fixed a CAM-ROM w.n suHice and this is the job of the ROM AND-plane shown in Figures 54 through 57. Since the 
index generation .s pe.to. med in a look-up manner (rather than algorithmically) there is no restriction to handling tables 

which are canonical , . * . 

The ROM Or -plane converts the "index" (an activated word-line) into the decoded data and the size (or length) of 
the code The data forms the decoded output (subject to error checking) and the size information is fed back to allow a 
calculation to be performed wh.ch controls the selector and. thus, presents the decoder ROM 302 with the correct data 
to perform the decoding of the next VLC in the subsequent cycle. 

The ROM 302 address of the present invention is in two fields. The larger field is the bit-pattern to be decoded and 
the smaller field selects which Huffman code table is to be examined. The bit-pattern which must be examined is quite 
long 16 bits, correspond,^ to the longest VLC code and there is an additional 4 bits of table select. Thus, there ns a 
total' address space of 20 bits (approximately one million addresses) although there are only in 450 entries in the ROM 
302. The reason for the difference is due to the existence of "don't care" bits. „ TW „ 

In order to decode VLCs the AND-plane must be able to decode "don't care" bits in the VLC bit-pattern. This is 
because all VLCs which are shorter than the maximum 1 8 bits will be followed by additional bits which form no part of 
the decoding of that VLC. Because of the wide address, the AND-plane is predecoded (2->4), and the ROM 302 must 
combine "dont care" handling with this predecode. Furthermore, in addition to the complete MPEG code tables, the. 
ROM 302 also has entries to identify illegal VLC patterns, which exist for some code tables. 

Maximizing Throughout 

in order to sustain output of one decoded item every cycle, some care must be taken to control the decoder input and 
special handling must be used for some "complex" symbols (i.e. . ones which are not single PLCs or VLCs). • 
In order to sustain peak throughput of Escape-coded coefficients it must be possible to input at least one complete 
code per cycle. Since the maximum length required is 28 bits in MPEG-1 this dictates the input word width of 32 bits 

(being the next sensible size greater than 28). ^ - » , 

Normal transform coefficients are also "complex" symbols, in the sense that they consist of a VLC followed by a 1 - 
bit FLC which gives the sign of the level value and are handled in a similar manner to the other complex symbols (e.g. 
motion Vectors Intra DC and Escape coded coefficients). Peak throughput cannot be achieved if coefficients are 
decoded as a VLC followed by an FLC (in separate cycles) and the alternative of allowing the ROM 302 to decode the 
sign bit would double the size of the two largest tables in the ROM. Thus in the present invention, special handling is 
used for various symbols so that a single cycle can produce the "final" required result. 

R P,s and Tokens 

The basis of FLC handling is to control the selector with the required length of the FLC and to bypass the ROM 302 and 
simply output the correctly selected FLC. Thus, simple FLCs are handled fairly naturally by the decoder, without signif- 
icant extra hardware. Furthermore, tokens are not manipulated, but simply passed directly to the output of the decoder. 

Implementation 

This section describes several irrportant features of the implementation of the decoder., in accordance with the present 
invention. The implementation includes the arrangement of registers with the counter 303 and selector 301, as shown 
in Figure 52, and the actual code ROM. 

The schematic of Figure 53 shows how the core components are interconnected to implement the mam Huftman 
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plane where predecode and SVlSSlS a? ^SET . T^* fe3tUreS ° f ,hiS ROM 302 ,ie in «» AND- 
codes. ^ 09 3re USed t0 ,m P |em ent a method of decoding variable length Huffman 
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shown in the casetrThe" S?x IS SSS.to ^relevant predecoded address lines, in parallel as 
using a parallel connection of ^^I^ST^T l°T ^ reqUired deCodin9 is achieved with °"t 
selected. They are combined TfSS^^^^^^ 3re P erformed both of which must be 

if both of the selects are aSe * 9 6 " WOnd " ne driV6r Such ,hat the ^line is only activated 

practice the invention with all of its attendant TZ Jf l^T ^ ° ne ° f ° rdinary sW " in * e art to mak « ™<* 

more detailed in depth undlrs^ a further. 

applet th0Sa ° f SW » in *• a * are included with this 

invention is intended to funrtio^ 9 deta ' ,ed and ° peration of *• environment in which the present 

in JSES^^ 3 existing need for further improvements 

a P .ura,ity outages Me^^^Z^Z^eZ^STs T*" and 3ppara,US 

and DATA Tokens pass over the sinole twTwTrJ '£222 f£ 9 u P P Pressing machine. Control tokens 
decode circuit is posHion^ n certain of7e Carry ' n9 * and data in toke " *)rmat A token 

that stage and ^ nrecS "icontSuo^ ° P - "- 01 1,16 t0kenS 35 tokens Pe^nent to 

tioned in selected Vteges a^^reSSl ™ a'ongthep.pel,ne. Reconfiguration processing circuits are posi- 

identified DATA TbS? A w^e ^IIZS^VS!!^ t0ke " ** reconR ^ s ^h stage to hand.Van 

dosed for i-Plementing^^r^ 'echn.ues are dis- 

b^time sy.chroni.tion. asynchronous swLg buffering. 

dJib"'. -io^^ nave been illustrated and 

His not intended that the invention be ^T^X^^^SLT ^ °' ^ ^ ACCOrc,in9 ' y ' 
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Start Code Detector 

Figure 59 shows the start code detector 201 (SCD) interfaces with other blocks of circuitry of the system in accordance 

Te SC^canT^m of as providing three distinct functions. First, the SCD 201 provides an input circuit that 
reives data either from dedicated pins or from the MP. 320. Second, the SCD 201 detects start codes .n the data, and 
third the SCD provides the necessary circuitry to assemble the incoming data into a format to be used .nternally w,th.n 
the coded data buffer (CDB) 321. 

Parser 

Fioure 60 illustrates the parser subsystem, in accordance with the present invention. Data which was formatted for the 
CDB 321 i" un^cked and passed to the parser which receives instructions from the MP. 320. Thereafter, the data .s 
passed via a two-wire interface to the rest of the system. 

Spatial Processing 

Figure 61 illustrates the components of the spatial processing circuitry. These components include an inverse modeler 

(.model) 325. an inverse zig-zag (IZZ) 326 and inverse quantizer (.quant) 327 and an ;™% se *^^ 

(IDCT) 328. The data passes into the Imodel 325. then to the IZZ 326. next to the Iquant 327 and then to the IDCT 328. 

Display Circuitry 

The display circuitry of the present invention is shown in Figure 62. This system includes a vertical upsampler 210. a 
horizontal scale subsystem 331. an output multiplexer 332 and a video timing generator 333. 

Parallel Start Code Detector (scdp) 

Th P start code detector 201 in accordance with the present invention, is a parallel start code detector, i.e., it passes 
Jata USSH This system is similar to that previously disclosed in British Application Serial No. 940591 4.4 fi.ed March 
TSSTSiSo /Jpplicaation Seria. no. 92306038.8 fi.ed June 30. 1992. (hereinafter "Bro..y"). However severa 
maior differences exisTbetween the two start code detectors. First, byte alignment is assumed. There is no sh.tt.ng of 
me Ita in order to find start codes in the present invention. Second, the present invention operates pnmar.ly w.th 

aTmPEG (1 and 2) start code consists of a unique bit (byte pattern) in the bit stream known as the startjcode_pref ix. 
The pattern is 23 zeros followed by a one. The 8 bits immediately following the start_code_pref.x is known as the 
Hart ccS value This indicates the type of the start code. Start codes arriving at the SCD of the present .nvention are 
required tobe byte aligned. Accordingly, the above data can be specified as a byte sequence. For example: 

0x00 
0x00 
0x01 
0xb8 

is a group_start code. 
Input Fifo 

The present invention is designed such that given a peak data rate of 250 Kbytes/s and assuming that the codec I data 
buffer does not overflow, the in.accept pin will never be pulled low. Hence, to calculate the length of the ,nput f rfo ,t .s 
necessary to know: 1) the worst case wait time for a swing buffer, and 2) the worst case data expans.on through the 
SCD 

With the input data arriving at the coded data clock rate, in accordance with the present invention, scdp will generate 
two stalls per start code (having removed three bytes from the data stream). 

Input Circuit 

* 

The input circuit of the present invention performs exactly the same way as is disclosed in Brolly. However, there are a 
few differences of note between the two circuits. First, the upi wonl be made to wait until a valid end of a token ('cause 
this may never set). Instead, it will be made to wait until a signal ln_token is low. Second, generation of the DATA 
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header, when entering byte mode, depends on there being some byte mode data. 
Start Codes 



5 in inV6nti0n ' MPE ° ^ COd6S ^ reCOSni2ed and co ™ rted <° *— ^ the SCO. These are shown 




Table 15. Start Code Values 
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Removal of Bit Stuffing 



stuffing bits 
OxOO// 8 stuffing bits 
0x00 
0x00 

0x01 //start_code_prefix 



removed. 



Search Modes 



The search_modes, in accordance with the present invention, are described as follows i 



in Table 16: 
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Search mode 


Operation 


0 


Norma! Operation 


1 


Search for picture_start or 
higher 


2 


Search for group_start or higher 


3 

* 


Search for sequence_start or 
higher 



Table 16. Search Modes 

Anv non-zero search mode causes all arriving data to be discarded until the desired class of start code is found At this 
point the search mode is reset to zero, and a start_code_search interrupt may be generated. A new control brt. 
stoplon.search. determines whether the SCD actually stops after generating the interrupt (the interrupt is also 
masked in the usual way. but stopping is not mandatory). 

mThe present invention search_mode is also set to zero if the SCD receives a FLUSH token. However, when the 
FLUSH token terminates discardall. searchmode is completely reset, i.e.. search_mode is reset by the combina- 
tion of a FLUSH token and discard_all. 

Non-aligned Start Codes 

Any run of more than one zero bytes followed by a 0x01 is a start code. Furthermore, any run of greater than 23 zeros 
NOT followed by a one is a non-aligned start code. In the byte aligned world, this translates to: If. after removal of bit 
stuff ing. 0x01 is not received, then the start code was non-aligned. Note that this statement actually misses some non- 
aligned start codes (where less than a byte of stuffing was involved). 

Rather than going to the effort of describing in the data sheet which classes of non-ahgned start codes are 
detected, the scdp of the present invention ignores them. In other words, stuffing is still removed. 

* * 

Overlapping Start Codes 

It is possible for the "value" part of a start code to form part of the "prefix" of a subsequent start code. This typically 
occurs for two reasons: 1) the standard allows system level start codes to occur anywhere in the stream - tnclud'ng 
directly in the middle of a video level start code, and 2) errors. Removing all erroneous looking start codes until the last 

one provides a better chance of error recovery. , . 30 

In the byte aligned environment, in accordance with the present invention, the only way an overlapping start can happen 
is if a picture start (value = 0x00) forms part of another start code. In this scenario, the picture.start is removed from 
the data. andlhe second start code is decoded. If this, in turn, is overlapped then the same procedure applies until a 
non-overlapping start code is detected- 
Unrecognized Start Codes 

In the present invention, the reserved values (OxbO. 0xb1. 0xb6). all system start codes (0xb9 to Oxff). and the 
sequence_error code (0xb4) are each treated as unrecognized start codes. After removing the unrecogn.zed start 
code, the SCD discards all incoming data until the next valid start code is found. It will also set the unrecognized_start 
error register and. depending on the unrecognized_start mask, will generate an interrupt. 

Extension and User Data 



Two configuration bits are used in the present invention: 
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1) Discard_user (or not) 

2) Discard_extn (beyond MPEG2 main profile, main level) 

Both of these configuration bits are reset to ONE 
MPEG2 extn start codes are different. The four bits following the extension start value are now an 
exlens-on^start.code^dentifier and must be decoded by the SCO. Four new tokens are geneS to flag thesT tS 
aNowed exlens,on^tar L cod eJ dentifiers and their respective tokens are shown in iLe Z ' SSrS 
exiensron^start code identifiers are not recognized. Unrecognized extension start codes are Xr <SS 
(depending on Discard.extn), or replaced with the (old) extension data token " discarded 
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extension_start 
code identifier 



Name 



New Token 



0000 


reserved 


0001 


Sequence Extension ID 


0010 


Sequence Display Extension ID 


0011 


Quant Matrix Extension ID 


0100 


reserved 


0010 


Sequence Scalable Extension ID 


0110 


reserved 


0111 


Picture Pan Scan Extension ID 


1000 


Picture Coding Extension ID 


1001 


Picture Spatial Scalable Extension ID 


1010 


Picture Temporal Scalable Extension 



SEQUENCE_EXTN 

SEQUENCEJDISPLAY_EXTN 
QUANT_MATRIX EXTN 



PICTURE_CODING_EXTN 



Head 



Oxeb 



40 



45 




Table 17. MPEG2 extension_stari_cocle_identifiers 



so 
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Insertion of PICTURE_END Tokens 



None of the current standards (MPEGt.2, JPEG, or H.261) specify a way of ending a current picture 

a pTt^ 201 maintains a piece of state called in_picture. Thfc state is set whenever 

pidu?e sfa^ 5y ? SCD 201 ' ^ y start that is higher in the syntax than 

picture_start (or a FLUSH token) causes the generation of a PICTURE_END token. The PICTURE END token is aen- 
erated and output before any tokens associated with the new start code. in_plcture is reset when the PICTURE END 
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token leaves the SCD 201 If the SCD 201 receives tokens in the input data stream, the action is logically identical - 
including receiving a PICTURE_END token. In summary, the start codes (and tokens) which may cause a 
PICTUREJEND to be generated, in accordance with the present invention, are: 

picture_start_code OR token 
5 ' grotp_start_code OR token 

sequence_start_code OR token 

sequence_end_code O R token 

FLUSH token 

io Stop After Picture Interrupt 

The stop after picture (sap) feature is of the present invention functions to facilitate a clean way of finishing off a current 
sequence, e.g.. channel change. It is necessary to accomplish this function as automatically as possible and without 
the need for external real time software. 
75 The sap control bit is referred to as a f lag_picture_end. 

There are two control bits in addition to the flag_picture_end, mask, and error bits: 

1) after _picture_stop: Determines whether, after generating the interrupt, the SCD stops. 

2) after jldure_dlscard: Having generated a flag_picture_end interrupt, this bit determines whether scdp auto- 
matically goes into discard_all mode. 

In this way. discard.all mode doesn't need to know what event called it and it is possible to leave the discard_a!l mode 

and to proceed to a search mode quickly and cleanly. 

In accordance with the invention, whene/er a PICTUREJEND token is output by the SCD, the f lag 
determines whether any action is taken. If f lag _picture_end is set. a FLUSH is generated after the PICTUREJEND. 
and the event is generated. Interrupting depends on the flag_picture_endjnask. and (having interrupted) stopping 
depends on after _picture_stop. 

By way of example, for a channel change, the sequence of events is as follows: 

1 ) Set f lag_picture_end with after j3lcture_stop = 0 and after _picture_discard = 1 

2) Respond to f lag_picture_end_event. 

35 a) Set search mode to sequence (for example), 

b) Retune, etc. 

3) Either FLUSH or s/w resets discard Jill. 

40 4) scdp searches for the start of the next sequence. 
discard_all 

An R/W control bit, discard_all. causes the scdp of the invention to discard all input up to and including a FLUSH token. 
This bit is automatically reset by a FLUSH token and may be set by the f lag _picture_end function. 

Tokens Recognized by scdp 

While the primary function of most of the scdp of the present invention is related to actual token generation, there are, 
several tokens which when applied to the coded data port (or via the input circuit) are decoded and acted upon by the 
scdp. Table 18 illustrates and defines these tokens. 
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w 



15 



20 



Token 



Header 



Action 



FLUSH 


0x17 


Flushes scdp 


1 1 icoy luKens may 


PICTURE_START 


0x12 


Sets in_picture 


cause the generation 
of a PICTURE_END. 


PICTURE_END 


0x16 


Resets in_picture 


In this case, they 
would reset 
in_picture and may 


GROUP_START 


0x11 




SEQUENCE_START 


0x10 




cause a 

fl a 9_picture_end 


SEQUENCE_END 


0x14 




event and a FLUSH 
to be generated. 


DATA 


0x04 etc. 


Data is searched for start codes 


Other 




Unrecognized tokens are passed through 
scdp unchanged 
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Table 18. Recognized Input Tokens 



Scdp Memory Map 



30 



The various registers and their associated addresses for the scdp of the present invention are described in Tabie 19. 
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Register Name 


! Bits 


| Reset 


flag_endJof_picture_mask 


(0) j 


0 



Comments 



Address 



Table 19. Parallel Start Code Detector Memory Map 
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DataFlow Around the Coded Data Buffer 

15 The present invention provides the following advantages: 

1) A method of forcing the buffer to swing. 

2) A way of avoiding having to pack bytes into an odd number of bits. 

3) Reducing the width of the (potentially long) bus of the SCD down to 8 bits. 
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Introduction 

3s This section defines the handling of time-stamp information.* accordance with the present invention. 
Theory of Operation 

time is aecoaer nas, at all times, a notion of what the current 

that follow that Sre taJJ^S. " P ' CtUre ° f W " be 006 more •« the number ° f ^-pictures 

The important complication to appreciate is that the DT^ anH ptq *~ „ ^ ~ *w , , 

— . ^ ^r^ T r^;rr^^sss 
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should display pictures (defined by the time-stamps and the clock references) This modification will depend^ on , the 
deSls of The architecture of the decoder. Clearly any delay introduced by the video decoder must be matched by an 
equivalent delay in the audio decoder. 

Discontinuities 

Discontinuities in the concept "time" may occur. For instance, in an edited bitstream each edit point wHI have discontin- 
ues time. A similar situation occurs at a channel change. Care must be taken because using a I^M« 
encoded in one time regime with respect to a "time" defined by a clock reference from another reg,me will clearly lead 

to incorrect results. 
Start-up 

A particular problem occurs at start-up (or channel change) because there are two potentially competing r^uirements 
for staSg to decode con ect.y For video considerations, it is now necessary to start decoding w, h an .-picture that fo - 
owl a system head* (th.s may not be true in all situations, but is .argely a correct statement) but for system consider- 
ed the L decoded p.cture ought to carry a time-stamp. However, there is no requirement that every p,cture carry 
a fime stamp .and. theXe it ,s possible that one may wait for ever if they try and look for a p.cture that is both an I- 

O^^L^ZZ* .he time-stamp would have been for an .-picture from a picture that precedesK that 
does have atne stamp Unfortunate* this is very diff icuH to do because it would be necessary to partajy decode £. 
frterlening pictures ../determine whether they are field or frame pictures (and whether repeaU-rstJ .eld .s set). This 
requires that the data go through the coded data buffer and be discarded by the Huffman Decoder. 

Embodiment 

Figure 63 shows a first embodiment for implementing time-stamp management. The clock references ,253 ;are decoded 
by the system demultiplex 254 of the present invention and placed into a counter 255. incremented at 90 kHz. that rep- 
resents are also toaded into a second copy of the counter 258 that is located ins,de the video decoder 270. 
T^e^tarr^s «ow through the video buffer 271 so that they are delayed by the same amount as me video data. 
ThLe are then compared with the local copy of time to determine whether the picture is too early or too late. 
IS TerSSn^ordance with thepresent invention, is shown in Figure 64. This avoKs the need for ft. .do* 
references 253 to be passed to the video decoder 270. This is achieved by using a second counter vid_t.me 272. 273 
wS is mainfained both in the video decoder 270 and the system decoder 256. They are 

free run from there on Since this embodiment requires that the two counters stay in step, it is necessary to take steps 
to ensure they T do not get out of step. This can be accomplished using carry out of the counter ,n the system demux to 

£r?£££ t&ISSXZSSi there is no need for the ful, 33 bits of the number to be dealt with. The 
foea^woufo ^e to restrict the counters to 16 bits to a..ow 16 bit handling on the video decoder 270. Although this would 
appearto represent an insufficient number range at a resolution of 90 kHz (on.y 2/3 second), there .s no need tor such 
511 because on the video decoder, the time control is on.y accurate to a field time e.ther way since the VTG 
free-runs (or is gen-locked to something that has nothing to do with the MPEG stream being decoded). 
M a resuU i seems that the .ower oraer few bits of the time-stamp going to the decoder can be discarded^ the 
present ^^nvention four bits are discarded. This means that the video decoder uses 16 bits of a 20 b,t number. The res- 
olution is thus. 5625 Hz and can represent a time difference of 1 1 .65 seconds. 

Therefore a PAL field is 1 12.5 ticks of the 5625 Hz clock. An NTSC field 93.84 ticks. Hence, it is still poss.ble to achieve 
timing calculations to an accuracy of about 1% of a field time which is adequate for the present invention. 

Hardware 

Figure 65 shows the hardware in accordance with the present invention. There are two modules in addition to those dis- 
closed in Brolly. The f irst is added just after the start code detector 201. ft is responsible for generating a token. A 
TIME STAMP token occurs just before a PICTU RE_START token. In the MPEG systems stream, the time-stamp is car- 
ried in~a packet header and refers to the f irst picture in the pa*et of data. Since the packets do not line up > w.ft the video 
t data there will, in general, be the end of the previous picture before the start of the p.cture to which the t.me-stamp 
refers 

The time-stamp information may be supplied to the system of the present invention either via the microprocessor inter- 
face or by using a Token. In either case, the time-stamp data (16 bits) is stored in a register. A flag is set to indicate the 
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toKent SSSSfi 5T ST re9iS,er ' *" W3S SUPP ' ied *• ™ E -^AMP token then that 

flag indicates that valid VhmS^SS^^S^ J? TT"* T * UnafcG,edL however - the 
before the PICTURE START token TlS tea therf^^ i ^ff* ,hen 3 T,ME - STAM P token is generated 
The second hardware mod" e Yslssodat ^11^2 T ' S aVa " ab ' e for me nex1 t^e-stamp that occurs, 

counters clocked from ^27 ZhIT^^Z^IT I T MaChin6 218 7,115 iS sim "* a se "* of 
option shown in the diagram is Lr^l^^^n^ T " ^ 48 ° 4 8 

?zrssr is ,he ,ime coun,er and is — tj e itzes^s*. ^ „ is reset ^ 

feedback flip-flops (SY^s, which are 

cern that the time counter in Brian miglS get out o^^ <™ S is b — - «* 

.^TKKSSS SSL"fiSrjSK SIT* ? 6 - - CoWare 

the time at which if Thou.c ?be de^fCi^s * there '° re whe,her « is ear * « «npS to 

The registers for use in the SCD 201 relating to time stamps are shown in Table 20. 



\ 



62 



EP 0 891 094 A2 



Register name 



tsjow 



CO 



8/rw 



tsjiigh 



8/rw 



ts valid 



l/rw 



O 
iS 

CO 
0) 



Description 



The lower eight bits of the time-stamp value. 

This register is slaved so that new values may be written into 
this register without affecting the value previously written (that 
will become part of a TIME_STAMP token). 

Writes to this register affect the master register whilst reads 
read-back the slave register. Until a master-to-slave transfer 
has been effected using ts_valid. the value written into 
tsjow cannot be read back. 

The upper eight bits of the time-stamp value. 

Slaved in the same way as tsjow. 

This bit controls the master-slave transfer of tsjow and 
tsjugh. 

When values have been written into tsjow and ts Jiigh the 
microprocessor should write the value one into this bit. It 
should then poll the bit until it reads back the value one. At 
this point, the values written into tsjow and tsjiigh will 
have been transferred into the slave registers (and can be 
read back) and ts_waiting will be set to one. 

The microprocessor should then write the value zero in 
preparation for the next access. 
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Register name 



CO 



ts_warting 



0) 
to 
0) 



Description 




When set to zero the registers tsjow and tsjiigh do not 
contain valid time-stamp information. 

When set to one the registers tsjow and tsjiigh contain 
valid time-stamp information. A TIMERS TAMP token will be 
generated before the next PICTURE_START token and 
ts_waiting will then become zero. 

This bit should be polled to ensure that it is zero before 
writing a one into ts_valid to ensure that the previous time- 
stamp value has been used before rt is overwritten by the 
master-to-slave transfer. 



Table 20. Time-stamp "SCD" registers 
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MSM Handling of Time-Stamp Information 

^E^Sp d SIn fUnCtl ° n ° f 218 ' ^ aCCOrdance with the * resent lnvention ' when » reives the 

First, a 16-bit signed time-stamp correction is added to the time-stamp that was carried by the TIME STAMP token 

J 0 " ,S ^ t0 2er ° by the MSM 218 3t Chlp " reSet and ' " no action is taken ' the time-stamps are unaltered 
™ e contro,,,n 9 microprocessor may. however, write any value into this register to modify the time-stamp and therefore 
compensate for differential delays through the video and audio decoders meretore. 
Next, the corrected time-stamp is subtracted from the current time. The sign of this gives the direction of the error (and 
22 ZTt err °^ e L jf any ' 9enerated *>y the MSM 21 *>- absolute value of the difference is then taken and 

rl 2 I ° 0mpar f k° frame " me - " reSUft iS ,6SS than me frame time ' no "b taken. As previous.y d™ 
cussed, time can only be controlled to an accuracy of plus or minus a frame time from the nominal time because the 
v i o tree-runs. 

!l?* Pr T'l- inV ? nti » n /.i! 6rr0r 6XCeedS 3 ,rame -* ime - then so™ correction must be made. The MSM 218 can cor- 
^r »^H J ? I 9 iS ,0 ° eaHy 8ince * 030 Simp,y de,ay * e decodin 9 un *' lhe appropriate time. How- 

S ,'JL i w ,'l 9 ,S f ,6r tha " the imend6d time> 1hen ,his is more "W""" becaus * » not possible to discard 
pictures ratably at the output of the coded data buffer. Essentially, the decoding of the sequence is broken and the rnost 

chTae t 9t r 4,00 T deCOdin9 PrOCeSS 3 ™ simi,ar to -ndom-access or channe. 

^.Tflu^ the contro1 re9ister °' the MSM 218 may be pro9rammed to discafd 311 da * 

Start-up 

l^i^n ! 8 ' ac ? 0rdance with 1,16 P resent invention - reives a time-stamp at a time which it recognizes as a start- 
mcSS *™™ & T el f0,l0Win9 8 SEQUENCE -END token or FLUSH token and it is stufbefore the fKt 

^Ir?^Z 1 ?h l aCt, ° n °' *" MSM 21 8 be m0dif ied - f the time " stam P indi <*'<* decoding should 
have occurred earlier than the current time, then the situation is handled in the same way as detailed above However 

rf the t.me-stamp .nd.cates that the decoding still remains to take place after the current time (which is the normal situ- 
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ation on start-up), then the decoder will wait until the correct time even if the error is less than one frame-time. In this 
way it is possible to set the nominal decoding time as accurately as possible to the correct time. Subsequent pictures 
may then be decoded, up to one frame-time before or after their nominal time, without any error situation being trig- 

fnSition, in the present invention the error "ERR_TOO_EARLY rt is not generated during start-up (since it is expected 
that decoding would be early) irrespective of the setting of disable_too_early. 

MSM Time-stamo error codes 

As a result of the time-stamp handling, it is possible that one of two errors will be generated. 
ERR TOO_EARLY is generated if the decoding is taking place earlier than the time indicated by the time-stamp. 
ERR~TOO LATE is generated if the decoding is taking place later than the time indicated by the time-stamp. 
ERRjrOcTEARLY may be suppressed, but E R R_TOO__ LATE will always be generated unless all errors are masked 

Table 21 describes the various time-stamp registers associated with the Microprogrammable State Machine, in accord- 
ance with the present invention. 
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Register name 



ts_correction 



frame time 



time 



manual_startup 



decode_dtsable 



Q 

CO 



16/rw 
16/rw 



0) 

CO 

"53 

to 

cr 



Description 



226 or 
188 



16/ro 
1/rw 



1/rw 



NTSC 30 



1/rw 



zero 



zero 



zero 



dtsabl«_too_««r1y j 1/rw zero 



zero 



Correction added to each time-stamp before it is used. 



Represents the tolerance on the timing of decoding pictures. 
Reset state determined by the PAL/NTSC pin. 



Reset by either reset or time.reset. T he current value of time. 

When set to one. the startup is to be performed manually using 
decode_disable. In this case, SEQUENCE_END and FLUSH 
tokens at the MSM cause decode_disable to be set to one. 

When set to zero, startup is performed using the time-stamp 
management hardware. Decode-disable is never automatically 
set to one. 

When set to zero, the decoding proceeds normally. 



At the start of each picture, the MSM checks the status of 
decode-disable and will not proceed if it is set to one. 

Note that if manual start-up is to be performed (i.e.. without the 
time-stamp management hardware) this bit should be set to one 
at the same time as manual-startup is set to one. 

When set to one. the error "ERR_TOO_EARLV indicating that 
the decoding is too earty is suppressed and the MSM simply warts 
to correct the situation. 



When set to one. the prescaler divides by 4804.8 rather than 
4800. Set automatically when decoding 30 Hz frame rates. 
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Register name 



discard_KJate 



CO 



1/rw 



discard_until 



2/rw 



0) 

2. 
"53 

to 

o 
a: 



zero 



Description 



This has no effect unless an "ERR_TOO_LATE" is generated (or 
would be generated if errors were not masked out). If it is set to 
one then data is discarded until the condition indicated by 
discard_until. 

Indicate the condition which causes time-stamp triggered 
discarding to be terminated. 

0 - FLUSH 



1 - SEQUENCE_START 

2 - GROUP.START 

3 - Next Picture. 

Note 1 - that discarding one picture may immediately be un-done 
if that picture is a Held picture by the generation of a dummy field 
to preserve the alternating top/bottom field structure. As a result 
if drscard_until is set to "Next Picture" but the dummy field would 
be generated one further picture is discarded. 



Table 21. Time-stamp "MSM" registers 



Sup port for 30 Hz 

The present invention does not support a 30 Hz frame rate properly. However, it will be appreciated by one of ordinary 
skS in Z ar ttat the invention may decode 30 Hz data if the clock generation circuitry is modrf led appropriately. In th s 
~se ttesv^emlsS^ed with a 27.027 MHz clock so that the typical CCIR-601" raster produces pctures at precisely 
3?Hz*n oSerTo achate the 27.027 MHz clock, it must be divided by 300.3 to provtte ft. 90W. dock Since 
the present invention scales this value by a factor of sixteen, it is necessary to d.v.de the clock by 4804.8. 

Introduction 

This section details a Micro-codeable State Machine (MSM). in accordance with the present invention. The aim of build- 
ing the MSM was to produce a machine that with small amendments can be used in a number of appl.cat.ons such as 

Tn^MS^^efeS genera, purpose nature providing support to a wide range of feature, How- 

ever me underlying structure of the MSM is modular, allowing flexibility in building. Accord.ngly. those of ordinary skill 
in the art will aDDreciate that the present invention can be used with a variety of applications. 
M shSSit Rgu^66 the system design is segmented into two sections. Thefirst is a State Machine 218. Th.s gen- 
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s^sw^^icsxasrss: see r r wi,e w — - ~ 

comprising an ALU 222 and associated roister «te »T52 f •«! ^ ^ 6COnd SeCt,0n ,S an Emetic Core 219. 
line. I, accepts data and instruc^tSe^hfcon^o ^ ^STSS2?? 9 " T "S* "* ** 
conjo, - a .o-wire int.ace Br in g in 9 these two ^^=^0^^ 

State Machine 
Jumps and Calls 

ssass^ ^xCrr^T?^ a condition is — — ** ~* - 

to unconditional jump or nSp re JSJj ST^^'^S^ ^ " pro * ted 

on the Status bus. If the condi ion is no 1 "true" or "falsi "T u ^ ondrt,ons « total) are based on tests 

has executed the instruction and Z Z stetus^teck to the ^ T- "i"" ,h6 Arithme,ic Core 219 

conditions are shown belo in Table 22 aCh,ne for ,esting a9ainst ,ne condition. These 




1 H the State Machine is atso con.rol.ing upstream btocKs. ,hese .wo 2-wire interfaces may be combined. 
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Code 


Condition 




1110 


V 


Overflow 


1111 


NE 


Extn bit is low 



10 TABLE 22. State Machine conditions 



75 



If a jump is taken on an instruction with the call bit set. the next address, had the jump not been taken will be stored as 
the return address. Accordingly, this forms a mechanism for routine calling. To return from the routine to the stored 
address a call is made to address 0x001 . Calling is only supported to a depth of one call. ..e.. only one return address 
can be stored. Nevertheless, calling from calls, although erroneous, is not checked for in the hardware. 



20 



25 



Interrupts and Errors 



In the present invention, if the interrupt/error wire sampled high, an unconditional jump is made to the interrupt/error 
address (address 0x001). The next address that was to have been taken without the interrupt/error .s stored. To return 
from the interrupt/error routine, is a jump to the interrupt address (0x001) is performed. 

™?M Machine 21 8. in accordance with the present invention, is hardwired to execute as either an interrupt or error 
routine The difference is that interrupt routines mask out other interrupts while executing, whereas error routines do 
not. The State Machine 218 is currently wired as an interrupt rather than an error pin. 



Jump Addresses 

The address loaded into the Program Counter is the Jump address. The twelve bits of this address are contained in a 
so ucie rSeS U ^neither be an absolute address or It may have portions substituted into It from the output ol I the, ALU 
222 If an address is to be substituted, the State Machine 218 will wait until the Arithmetic Core 219 has executed the 
instruction and fed the ALU 222 output to the State Machine for the substitution. 

%?EZ cSteaddress. in accordance with the present invention, is shown in Table 23. "Jump Address iMMq. 
The Ms marked V indicate absolute address bits. The remaining address bits of lesser s,gnrf .cance vwll be substituted. 
35 The LSB marked "s" is the substitute bit. 
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10 



15 



20 



25 
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35 



No. Bits 
Replaced 



8 



10 



11 



12 



Load Return 
Addr. 



I B 


A 


9 


8 


7 


6 


5 


4 


3 


2 


1 


0 


s 


1 3 


a 


a 


a 


a 


a 


a 


a 


a 


a 


I 3 


a 


0 


1 3 


a 


a 


a 


a 


a 


a 


a 


a 


a 


a 


0 




1 3 


a 


a 


a 


a 


a 


a 


I 3 


a 


a 


Jo 






1 3 




a 


I a 


a 


a 


a 


a 




0 








1 3 


a 


a 


a 


I 3 


a 


a 


a 


0 








1 3 


a 


a 


a 


a 


a | 


a 


0 (i 1 1 






1 3 


a 


a 


a 


a 


a 


0 


1 1 








1 3 


a 


a 


a 


a 


o I 


1\ 


1 












a 


a 


a 


a 


0 


J I 




1 












a 


a 


a 


0 f 


1 






1 












a 


a 


0 


1 


1 






1 












I a 


0 


1 ' 


1 


1 






1 












0 


1 


1 


1 


1 






1 












1 


1 


1 


1 | 


1 






1 
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Table 23. Jump Address substitution 

The address substitution feature of the present invention allows the construction of jump tables. 
State Machine Internal Instructions 

SSn^f ^ JO Pert ° rm uf Peat6d conditional tests °" status bus. These instructions are internal to the State 
Machine 218 and requ.re stable feedback from the Arithmetic Core 219. Therefore, these type of instructions be 
marked as non-vahd for the Arithmetic Core 219. which will then fail to execute them. Accordingly TSSZTj^ 
vided to mark instructions as valid for the Arithmetic Core 219. P 

State Machine Testing 

^ P T* ni inVenti ° n ,0 6nable thG Slate Macnine ' s 218 operation to be verified, a number of registers will be acces 

«?J5i2? f™ ine is then ha,ted and H is 53,6 to acce - ^ e - Je - be 

When the microprocessor has access, it can read and write to the following registers: 

the program counter 
the call return address 
the interrupt return address 
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- the interrupt status bit (i.e., stating whether a interrupt is in progress) 
all bits of the ucode 

Table 24 describes the various addresses of these registers. 

The State Machine 2 1 8 can also stop itself by generating a microprocessor event. Only if the event s mask bit is set wiH 
the machine halt. Access should then be gained in the normal way when servicing this event. An event can be brought 
about by a call to the reset address (0x00). The call will not actually be taken, but simply generate the event after the 
instruction is executed. It will, nevertheless, remain at the output of the instruction ROM for inspection. 
The State Machine 21 8 of the present invention possess a mode in which it will single step through its instructions. Sin- 
gle stepping is initiated by setting bit 0 of the MSSR register. The machine will then stop before each instruction The 
stopped state is indicated DyT- Stopped. 7*e instruction about to be executed will then be at the output of the instruc- 
tion ROM and is able to be changed via microprocessor access. To restart the machine, write 1 to bit 1 of the MSSR 
register. Both of these bit registers are synchronized and. therefore, require microprocessor access before they can be 
accessed. 

State Machine Ucode Map 

Table 25 shows the microcode map for the State machine of the present invention. 
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Address 


Use 


0x000 


reset address 


0x001 


interrupt/error address 


0x002 


ucode program 
addresses 


-Oxfff 
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Table 25. State Machine Ucode Map 



State Machine ucode word 

40 Similarly. Table 26 depicts the State machine microcode word, in accordance with the present invention. 



45 




Table 26. State Machine Ucode Word 



where: 

a = address; 

s = substitute an address; 
c = call or jump; 
55 condition = jump condition code; and 

v = instruction valid for Arithmetic Core 
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w 



15 



20 



Arithmetic Core 



?™^?T ,nVen,i ? n ; AritHmetiC C ° re 219 PSrforms a " ,he da,a ^"iPu'ation within the MSM 218 As shown in 
s^cTe^ 

^ ™ 9d - in r i9UrS ?"u * - Arithmetic 219 of the invention has three main functional blocks the token oort 360 
£ R^r < S' n9 ^ h ? e u da,a Str6am; lhe ALU 222 (and DOSsib, y olher functi °"s) for execSng art 
* 1 Wh ' Ch ^° mainS a " ,he re9iStere - A " 1)0568 are ""baled in Figure 68. bSto btoS am 
se.ected from these buses. The size of these se.ectors and their inputs can vary and are under ucode ^ntrol 



ALU 



25 



The ALU block 222, in accordance with the present invention, is responsible for all the compulsions and number 
rnanrpu afons m the arithmetic core. It allows quite complicated computations (such as recircuSn mS3£EES 
STLh %r*TV"- 3 COmbina,ion 0f re,ative, y ^ operations (i.e. shifting. ^nS^^S^S 

met Core 2,T~ a ^T*^ " then provided aS to ™* I 

metic Core 2 1 9. as a whole, to perform the more complicated computations. 

Shift Block 





[shift function I 


00 


I' = I 


01 


I' = I; NOP 


10 


l' = (I « 1) + K 
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I ss I 


jshift function 


11 I 


|r = (i » 1) + (K << 32) ] 



Table 27. Shift Block 

&2££ZiZ£Zr' ALU 222 as a whote m ^ aN ° ^ ra,fo - « - *— ~ — — 

Carry Block 

HH^Z ^ S he L lf k l S ,hS bit fr ° m sta,us re 9 is,ers or c, ears it. In single word addition and subtraction ooer- 
afjons. the carry brt will be cleared, while in multiple word operations, the carry generated by the UtoZntaM 
stored in the status flags) will be used as the carry. This is depicted in Table 28 ^ ( 
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0 


C = 0 


1 


C = H from status flag 



10 



Table 28. Carry Block 
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( >ndition Block 

In accordance with the present invention, the block conditions, the Augend and the Carry to the ALU core functions are 
defined in Table 29. 
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ii 


invert function 


00 


J* = J 




C* = C 


01 


J' = ~J 




C* = -C 


10 


J' = J & L 




C = C & L 
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AO 





linvert function 


11 


J' = (L?J:~J) 
C = (L?C:-C) 
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Table 29. Condition Block 
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ALU Core 

The ALU core 222 of the present invention performs a simple set of logic and arithmetic functions using two's comple- 
ment arithmetic. These are defined in Table 30. 
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ff 



10 



ALU core functions 



0 j 


|r + j* + c 


Add 


1 I 


Ir A j 


XOR 


10 | 


Ir & J" 


AND 


11 I 


Ir | j' 


OR 



15 



Table 30. ALU Core 
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ALU ™ 2 ™ ^ S,3,US " a9S 9enera,ed - (See Tab,e 31 > ™ ese are »oth stored in the Reg- 
ister F.le 221 (as shown ,n Table 36) and are sent back to the State Machine 218 for comparison with condition codes. 
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I Meaning 


invert function 


Carry 


Carry Out from ALU operation 


Zero 


ALU result is zero 


Negative 


MSB of ALU result = 1 


Overflow 


ALU operation overflows 
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Table 31. Status Flags generated by the ALU core 
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ALU Ucode Wnrri 

Table 32 illustrates the ALU microcode word. 
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6 


I 5 


4 


3 


2 


1 




s 


s 


I 


I 


f 


f 


c 



Table 32. ALU microcode word 
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where 

ss is the shift block controls 
ii is the condition block controls 
ft is the ALU core controls 
c is the carry block controls 
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Use of the ALU 

Table 33 describes the bits patterns for the various functions of the ALU. in accordance with the present invention. 



Table 33 
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Bit number 


6 


5 


4 


3 


2 


1 


0 


Addition (l+J) 


0 


0 


0 


0 


0 


0 


0 


Subtraction (l-J) 


0 


0 


0 


1 


0 


0 


0 


Multiplication 


1 


0 


1 


0 


0 


0 


0 


Division 


1 


0 


1 


1 


0 


0 


0 



Register File 

Figure 69 illustrates the register file 221 of the present invention. The register file 221 contains 64 thirty-two -bit word 

registers The register file 221 can address partial words, i.e.. the file can be addressed as 64 x 32 b,t. 128x16brt 256 

x 8 bit 512 x 4 bit 1024 x 2 bit. or 2048 x 1 bit formats. The address is provided directly from the ucode or the address 

may have portions of it substituted into from special registers. This allows indexed access of the register. 

At each instruction a read-modify-write will be done on a single register. The read-modify-write facilitates the wntrng of 

partial words back into the file. The source of the write is determined by an external multiplexer with rts own independent 

ucode If no write is desired, the output of the register file 221 should be selected by the multiplexer. 

Partial words will be treated as signed or unsigned numbers dependent on bit 0 of the mode register. If the part.al word 

is negative (i.e.. it has its MSB set) it will be sign extended up the full width of the bus. This allows the easy use of partial 

words in arithmetic. ... 
Three locations in the register file 221 of the present invention are also connected to a dedicated bus. but they are still 
allowed to be used in parallel with other register file locations. These are the A and B registers and the status register 
shown in Figure 69. The register file also contains the index registers for address substitution with accompanying ter- 
minal count registers, constant registers and a mode register specifying modes of the register file. 

Register File Addressing 

The addressing, in accordance with the present invention, must cope with two different features: variable length 
addresses for accessing varying width portions of words, and address substitution. 

To address partial words requires a longer address. Therefore, all addresses are of variable length and they are 
encoded as follows: where "a" is an address bit. the least significant of the address bits is "S". the substitution bit. 
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TABLE 34. Variable width addressing 

The ^ssing is big end.an That is to say the higher, more sign* icant portions of the words are addressed with lower 



addresses. 

Portions of the addresses *a .a" can be substituted with 



one of the index registers. Using an address of an eight bit 

which are 
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Bits to be sub stituted | C 
0 



1 



8 



B A 9 8 



I 0 


0 


0 




a 


a 


a 


a 


a 


a 


a 


a 


a 


0 


0 


0 


0 




a 


a 


a 


a 


a 


a 


a 


a 


0 


1 


0 


0 


0 




a 


a 


a 


a 


a 


a 


a 


0 


1 


1 


0 


0 


0 




a 


a 


a 


a 


a 


a 


0 




1 


1 


0 


0 


0 




a 


a 


a 


a 


a 


0 


1 




1 


1 


0 


0 


0 




a 


a 


a 


a 


0 


1 


1 




1 


1 


0 


0 


0 




a 


a 


a 


0 


1 


1 


1 




1 


1 


0 


0 


0 




a 


a 


0 


1 


1 


1 


1 




1 


1 


0 


0 


0 




a 


0 


1 


1 


1 


1 


1 




1 


1 



TABLE 35. Address substitution 
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It can also be seen that wrth the above scheme, rt is possible to use illega, addresses, such as 0bOO0oS)O00000 or 
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Obi 11 1 1 1 1 1 1 1 1 1 1 1 ■ Illegal addresses will result in no address being accessed, leaving the output bus of the register 
file unknown. 

R oister Filfi Register Types 

In the present invention, there are a plurality of register f ile register types. Each is described as follows: 
• Independently bused registers 

Three registers (A. B. and Status register) have their own dedicated buses, as well as being accessible in the normal 
wav in the register file. This allows the registers to be connected to more places in the arithmetic core 219 and to be 
accessible in parallel to others in the register file. The independent buses can only access the registers in their full 

width, i.e., 32 bits wide. _ . .. . . „ 

There is no ucode write enable to these registers. Writing to them is only by way of an external multiplexer which has 
its own ucode control word. To prevent a write, they must be written to with their own value as shown in Figure 70. 
When the independent bus registers are written to as if in the register file, the independent bus write is suppressed. 
The Status register is implemented as an independent bus register. The bits of the register are defined in Table 36. 
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Bit I 




Meaning 


o I 


1 ' 


Index Reg 


1 I 


, E 


Extn 


2 I 


V 


Overflow 


3 


| N 


Negative 


4 I 


Z 


Zero 


5 


| c 


Carry 


6 




Gnd 


7 




Gnd 



Comment 

An index register increments passed its ter minal count 
Extension bit from input 
ALU operation overflows 
MSB of ALU result = 1 
ALU result is zero 
Carry from ALU operation 
Unused 
Unused 

TABLE 36. Definition of the Status register 
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• Index and terminal count registers 

Two eight bit Index registers are provided for substituting into addresses. One of these can be incremented per instruc- 
tion under the control of the ucode. Furthermore, each is accompanied by a terminal count register. When the register 
incremented is passed, its terminal count will be reset to zero. 

The index registers are called Y and Z which have terminal count registers U and V, respectively All of these can be 
accessed in the register file. 

Index register Z has a predefined decoder attached to its output (at present this decode is an inversion). Dependent on 
the lndex_Mode in the mode register (bit 1), this decoder rather than the index registers, will be used in address sub- 
stitution and read from Z in the register file. (Index Jvlode = 1 read decode, lndex_Mode = 0 read count) 

55 • Constant registers 

In the present invention, sixteen of the register file's 32 bit locations will be predefined constants. These may be read 
out as normal registers. Writing to these locations will have no effect. (The constants selected for the current embodi- 
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ment are 0-7. However, ft will be appreciated that other numbers of constants may be used ) 

This .mptementation of constants in accordance with the present invention, does away with the need for a constant fipW 
m the ucode and for a constant bus in the arithmetic core. It does, however limit the nSer J f £ 

S r el ( ^ nU ? er 16 ? " e9 ° tiab,e - ) ^ """I* are « a per instance SKS!^ vTrJ 

frequently used values could be connected to multiplexers, if necessary. "'ermore, very 
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Register Hie Address Ma p 

Table 37 shows the Register File address map for the present invention. 



15 



20 



25 



30 



1 32 Bit Location 1 


[Bits 


[.Register | 


0x00 


All 


A register 


0x01 


All 


B register 


0x02 


7:0 


Status register 


0x02 


8 


Sign Extend mode ■ 


0x02 


9 


Index Decode mode 


0x02 


31:10 


Normal register 


0x03 


7:0 


Y index register 


0x03 


15:8 


Z index register 



35 



40 



45 



32 Bit Location iBits Reqister 



0x03 


1 3 1 : 1 6 


Normal register 


0x04 I 


| 7 0 


U terminal count register 


0x04 | 


| 15:8 


V terminal count register 


0x04 I 


pi:16 


Normal register 


0x05-0x37 | 


| All 


Normal registers 


Ox37-Ox3F I 


! All 


Constants 



SO 



TABLE 37. Register File Address Map 



55 Register File Ucode Word 

Table 38 shows the Register File microcode word for the present invention 
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Bit 


d 


¥ 

c 


b 


a 


9 


8 


7 


6 


5 


4 


3 


2 


1 


0 




No. 






























5 


Bit use 


a 


a 


a 


a 


a 


a 


a 


a 


a 


a 


a 


s 


r 


! 
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where 



Table 38. Register File Ucode Word 
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a 
s 
r 
I 



whole register file address (always 12 bits) 
substitute bit 

index register to use for substitution; select Y, Z index registers if n = 0, 1 respectively 
increment index register specified by r 
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Token Port 

The Token Port of the present invention is the arithmetic core's connection to the data stream. It is a two-wire interface 
connection. 

The data at the Token Port input is only defined during a Token Port reading cycle. It should, therefore, be used during 
read cycles only. 

If the input port does not contain valid data during a read cycle or the output port is not accepting during a write cycle, 
the Arithmetic Core will stall. Accordingly, it will perform no operation, read no new ucode word, and write no registers. 
It will only restart when these conditions do not exist. 
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Token Port Ucode Word 

Table 39 depicts the Token Port microcode word. 



35 




1 


0 


I 


0 



40 



Table 39. Token Port Ucode Word 



where 



I = read into input port 
45 O = write from output port 



Multiplexers 

The selection of sources for blocks is done by the use of multiplexers. Almost all combinations of bus are allowed (with 
so the exception that the input to functional blocks, e.g.. the ALU must be from storage blocks, e.g. . Token Port or Register 
File). 

The multiplexers are either 2, 4 or 8 input. They, therefore, use 1 . 2. or 3 bits of ucode word, respectively, to control their 
selection of inputs. 



55 UPI Memory Map 

Table 40 shows the MSM address map. in accordance with the present invention. 
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10 



15 



20 



25 



30 



1 Address 


1 Bits 


I Location 


0x000 


jo 


[ MSM Event bit 


0x001 


1 0 


MSM Mask bit 


I 0x100 


I 7 


Access bit 


0x101 J 


1 0 


[MSSR Set single stepping I 


I 0x101 


1 


[ MSSR Monitor Single Stepping 


| 0x101 | 


I 2 


MSSR Interrupt status register (Read Only) I 


0x102 I 




Program Counter MSB | 


| 0x103 I 


| 7:0 | 


Program Counter LSB 


[ 0x104 | 


! 3:0 | 


Call Return Address MSB 


| 0x105 | 


7:0 | 


Call Return Address LSB 


0x106 I 


3:0 I 


Interrupt Return Address | 


0x107 1 


7:0 I 


Interrupt Return Address 1 




Table 40. MSM Address Map 
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Introduction 

In the MPEG coding standards (both EGMP-1 and MPEG-2) the quantized coefficients are coded as "events". Each 

i^?* IS T^iS. a . R li N ^ 3 LEVEL The RUN 18 thS nUmber 0f Zero <*> efficle "* that precede a given non-zero coef- 
ficient. The LEVEL is he value of that coefficient. In addition, one special event. End-of-block. is used after the last non- 
zero coefficient to indicate that the remainder of the block is all-zeros. 
For example, assume the following sequence of coefficients: 

1. -7, 0. 3. 0. 0. 0. -1. 0. 0. 0. 0. ...0 (total of 64 coefficients) 
These would be modeled by the following events represented as (RUN LEVEL)* 

(0, 1) (0.-7) (1.3) (3.-1) (EOB) 
It is the task of the inverse modeler to reverse the modeling process such that each of the 64 coefficients is represented 
as a simple number for subsequent processing. 
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Interfaces 

The following signal pins are used to transfer data into the inverse modeler of the present invention: 



• level [1 1 :0] 
run[5:0] 
in_extn 
ln__valid 
ln_accept 

Tokens are transferred on the levelp 1 :0] bus (in the lower order eight bits; level[7:0]). 

run[5:0] serves as an auxiliary bus to carry the RUN information. It has no meaning except in the data words of a DATA 
token. 

The following signals are used at the output of the inverse modeler: 



out_data[1 1 :0] 
out__extn 
out_valid 
out_accept 

Functional Description 

Data in DATA tokens is expanded so that there is always 64 coefficients in the DATA token presented at the output of 
the inverse modeller. In most cases, the last data word of the DATA token will not cause the 64 th coefficient to be gen- 
erated This is not an error, it is just that at this point the EOB event would have been coded in the bitstream. Therefore, 
in this situation the inverse modeller must continue to output zero data token words until a total of 64 coefficients have 
been produced at the output. 

In certain circumstances (e.g.. when a data error occurs) it is possible for the DATA token at the input to the inverse 
modeller to represent more than 64 coefficients. In this situation, the modeller must discard all the extra data and pro- 
duce a token at its output that contains just 64 coefficients. 

All non-DATA tokens that appear at the input are simply transferred, unmodifed, to the output of the inverse modeller. 



Timing Requirements 

It is a requirement of the present invention that data flow through the inverse modeller at the clock rate. 

In the situation where there are no gaps at the input to the Imodel and the circuity connected to the output does not 

cause the Imodel to stall (i.e., in_valid = 1 , out_accept =1 ) then a new data word will appear at the output of the Imodel 

every clock cycle. Note, however, that in this situation, the Imodel may not accept new data at its input on every single 

clock cycle because a non-zero RUN (in a DATA token) will cause more than one data word to be produced for each 

input. 



Microprocessor Interface Access 

The inverse modeller circuitry of the present invention is not required to be connected to the MPI in its normal mode of 
operation. Note that the error condition (too many coefficients) should not produce a microprocessor interrupt, ft is sim- 
ply dealt with internally by discarding the extra data. 

However, microprocessor access will be required for the snooper (test) circuitry at the input of the block. 



Introduction 

In the MPEG coding standards, the coefficients are "zig-zag" scanned so that the lower frequency coefficients are trans- 
mitted before the higher frequency coeff icients. 

It is the function of the inverse zig-zag, in accordance with the present invention, to convert the one-dimensional stream 
of coefficients it receives from the inverse modeller into a two<Jimensional array of coefficients that can be processed 
by the IDCT 

In MPEG-1, only one scan path was used, this was literally a zig-zag (hence, the name). MPEG-2, however, uses two 
scan paths. The first is the original MPEG-1 path, the second is optimized for use in interlaced coding where there tends 
to be unusually large vertical frequency components. 
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in addition to the coefficients which are obviously transmitted in zig-zag scan order, the quantization matrices are down- 
loaded in zig-zag scan order as well. This occurs in MPEG-1, H.261 and JPEG. As a result, the present invention has 
its quantizer before the inverse zig-zag (which was implemented as part of the IDCT). The quantizer therefore oper- 
ates on a one-dimensional stream of coefficients which arrive in the same order as the downloaded quantization 'matrix 
coefficients. Hence, the quantizer simply has to associate the first coefficient with the first matrix element the second 
coefficient with the second matrix element, and so forth. 

However, since there are now two scan paths in MPEG-2. a new approach was taken on the present invention in which 
the inverse zig-zag precedes the inverse quantizer. Both the coefficients and the downloaded matrices are inverse 
scanned and the inverse quantizer now operates on the two dimensional data. It should be noted that this is only pos- 
sible because in all three representations of the data (two zig-zag scans and the raster-scan order at the output of the 
IZZ) the first coefficient is always first and the last coefficient is always last. The first coefficient is specially treated in 
the Iquant because it is the DC term. The last coefficient is specially treated because it may need to be modified as a 
result of mis-match control as a function of the values of all the other coefficients (so it must be last!). The 62 remaining 
coefficients are all handled in the same manner (excepting that each has its own quantization matrix element). 

Interfaces 

The following signals are used at the input of the inverse zig-zag of the invention: 

in_data[11:0] 

• in_extn 
in-valid 
in-accept 

The following signals are used at the output of the inverse zig-zag: 

out_data[11:0] 
out_extn 
in_valid 
out_accept 

Functional Description 

The IZZ responds to the following tokens. 

• PICTURE_START 

• ALTERNATE_SCAN 

• DATA 

• Q UANT_TA BL E 

All other tokens are passed, unmodified, through the IZZ. 

The PICTURE_START token causes the IZZ to reset its internal state which represents which of the two scan paths is 

in force (e.g., alternate_scan) to zero (indicating the MPEG-1 scan). 

ALTER NATE_SC AN is a token which can be allocated the value 0xe6 with a mask Oxfe 

The ALTERNATE_SCAN token is shown in Table 41. 




Table 41. Alternate_Scan Token 



y is the indication of which scan to use for subsequent DATA tokens and is. therefore, loaded into the IZZ reaister 
alternate_scan n . 

DATA tokens are re-ordered according to scan path zero (the MPEG-1 scan path) irrespective of the setting of 
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alternate_scan Note that alter nate__scan must retain whatever value it had (i.e.. must not be set to zero) so that sub- 
sequent DATA tokens are correctly handled. QUANTJABLE tokens are re-ordered according to scan path zero (the 
MPEG-1 scan path) irrespective of the setting of alternate_scan. Note that alternate_scan must retain whatever value 
it had (i.e., it must not be set to zero) so that subsequent DATA tokens are correctly handled. 

5 

Mal-formed Tokens 

Both the DATA and QUANTTABLE tokens may be malformed. Clearly, the DATA token should be correct since the 
Imodel should have ensured that it is correctly formed. However, no such assurance is available for QUANT__TABLE. 
w Since handling the malformed QUANTTABLE tokens must be implemented, it should also be implemented for DATA 
tokens as well 

in accordance with the present invention, DATA and QUANTTABLE tokens are too short when they appear at the 
input to the IZZ should result in a token at the output with the correct number (64) of data words. The data contained in 
those words is unimportant and will probably be whatever junk happened to be in the reordering RAM before the start 
is of the token. Similarly, DATA and QUANTTABLE tokens that are too long should also result in correctly formed tokens 
at the output. The first 64 coefficients (matrix elements) should be used, the remainder should be discarded. 
Following a mal-formed token, all subsequent (correctly formed) tokens should be handled properly. 
There is no requirement for a microprocessor interface error (interrupt) to be generated. 

20 Raster Scan Order 

At the output of the IZZ, the DATA and QUANTTABLE tokens of the present invention represent two<iimensional data. 
However, the coefficients are still actually transferred as a one-dimensional series of numbers. The question arises 
whether the data should be transferred as rows or as columns. 
25 The prediction circuitry will require the pel-domain data to be organized in raster-scan order. Since the IDCT transposes 
the data it follows that the data going into the IDCT must be the other way around. Table 42 illustrates the order of the 
coefficients transferred at the output of the IZZ for DATA and QUANT.TABLE tokens. 
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Table 42. IZZ Output Coefficients 
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Microprocessor Interface Access 

There is no requirement for microprocessor access in the normal functioning of the IZZ. However, access will probably 
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be required so that the reordering RAM can be tested. It is also expected that there will be no requirement for a snooper. 
The one at the start of the Imodel is sufficient for both blocks. 

Introduction 

5 

This section deal with predictions. In this introductory section, all possible prediction modes are enumerated and dia- 
grams are provided for each one to explain exactly what must be dona 

Throughout this section no special attention is given to operations, such as half-pel filtering, that occur in the horizontal 
dimension. This is because these operations are the same as those on Brolly In the vertical dimension, however, things 
io are very different because of the interlaced picture format. 

Prediction in Frame Pictures 

Fr ame-ba s ed Pre di ctio n 

15 

In this mode, a prediction is formed from a reference frame. The result is as if the two reference fields were first com- 
bined into a frame and then a prediction were made from that frame. Note that this is precisely the situation as 
described in Brolly. 

Half-pel filtering may be made in the vertical direction and this is triggered by the least significant bit of the vector. In 
20 addition to the least significant bit, the next most significant bit (bit 1) has special significance since this will determine 
whether the top line of the prediction comes from the top reference field or the bottom reference field. 
Thus, four cases have to be considered, each dependant upon the binary value of the least significant two bits of the 
vertical vector 

25 vector [1] = 0, vector [0] = 0 

As shown in Figure 71 . just 1 6 lines (8 for the chroma) are read (since there is no half pel filtering. 8(4) lines from each 
reference field. 

30 vector[1J = 0, vector[0] = 1 

Likewise, as shown in Figure 72, 17(9) lines are read, 9(5) lines are read from the top reference field, 8(4) lines from 
the bottom reference field. 

35 vector [1] = 1 , vector[0] = 0 

Again, as shown in Figure 73, just 16(8) lines are read but note that now the top line of the prediction has been read 
from the bottom reference field. 

40 vector [1] = 1, vector[0] = 1 

And, Figure 74 shows 17(9) lines are read, 8(4) lines are read from the top reference field, 9(5) lines from the bottom 
reference field. 

Accordingly, bit 1 indicates which reference field holds the top-most line that must be read to produce the prediction. In 
45 addition, if bit 0 is also set, it indicates which reference field has the extra line to enable half-pel filtering to be performed. 
It is clear that half-pel prediction cannot be performed until both fields have been read from DRAM. 
Great care must also be taken when scaling vertical motion vectors to obtain offsets in the field store. The following 
table, Table 43, illustrates the effect: 
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1 " . * 




Offset in field 


Vector 


Bit pattern 


top field 


bottom field 


-2 


...11100 


...11110 (-2) 


...1111 (-2) 


-1.5 


...11101 


...11111 (-1) 


...11110 (-2) 


-1 


...11110 


...1111 (-1) 


...11111 (-1) 


-0.5 


...11111 


...00000 (0( 


...11111 (-1) 


0 


...00000 


...00000(0) 


...00000 (0) 


0.5 


...00001 


...00001 (1) 


...00000 (0) 


1 


...00010 


...00001 (1) 


...00001 (1) 


1.5 


...00011 


...00010(2) 


...00001 (1) 


2 


...00100 


...00010(2) 


...00010(2) 



Table 43 



Field-based Prediction (in a frame Picture! 

In this mode, each field is treated independently. A separate vector is used for each of the two fields. Associated with 
each vector is an additional single bit flag (motion_vertlcaLf ield_select) that indicates whether prediction should be 
made from the top reference field or the bottom reference field. The bottom bit of the vector still indicates the need for 
half-pel filtering, but bit 1 has no special significance. Note that a field vector measures different units of a frame vector; 
a field vector with the value n represents the same actual displacement (on the glass) as a frame vector with the value 
2n. 

This time, however, there are sixteen cases to consider (since there are four binary variables; 
motion_vertical_f ield_select for each of the two vectors and bit 0 for each of the two vectors). There are too many 
cases to draw, hence, the following figures only deal with the prediction of the top field. The bottom field is obtained in 
an analogous manner. 

As depicted in Figure 75, motlon_verticaLf ield_select = 0, vector[0] = 0 

8(4) lines are read from the top reference field to form the top field of the prediction. 

Figure 76 shows motion_vertical_f leld_select = 0. vector\0] = 1 

9(5) lines are read from the top reference field which are then half-pel filtered to form the top field of the prediction. 

Likewise, Figure 77 depicts motion_verticaLfield_select = 1, vector [0] = 0 

8(4) lines are read from the bottom reference field to form the top field of the prediction. 

And. Figure 78 illustrates motion_verticaLf ield_select = 1 . vector[0] = 1 9(5) lines are read from the bottom reference 
field which are then half-pel filtered to form the top field of the prediction. 

Dual Primfi (in frame pictures) 

Dual prime is a special case of the Field-based prediction of the previous section. Essentially, dual-prime combines two 
features: 

• A special method of coding the vectors so that despite the fact that four independent field predictions are formed 
(independent in the sense that they each have a distinct vector) effectively only one motion vector is transmitted. 
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Thus, the vector overhead is dramatically reduced. 

For each field, the precfiction information is read from each of the reference fields. This is then averaged to form the 
final prediction. This is very similar to the B-picture case when a separate forward and backward prediction is made 
and then averaged. 

5 

In the present invention, the vector decoding will all be performed in the parser. Accordingly, when the prediction cir- 
cuitry receives data, there really will be four separate vectors. 

The dual-prime averaging will be performed by re-using the B-f rame averaging circuitry (dual-prime cannot itself be 
used in a B-frame). Hence, the only associated complication for the prediction circuitry is involved in the signaling that 
10 indicates that the backwards predictions (using backwards vector tokens etc.) should be performed from the forward 
reference fields (as opposed to the backward reference fields). Since a P-picture should never normally request a back- 
ward prediction, the prediction circuity merely needs to keep a record of the picture type (P or B) in order to be able to 
decide which reference store to use for a "backward" prediction. 

15 Prediction in Field Pictures 

Field-based Prediction 

This is very similar to field-based prediction in frame pictures. There are four cases depending on 
20 motion_vertical_f ield_select and the least significant bit of the motion vector. Note that it is not really relevant to dis- 
cuss top-fields and bottom-fields in the prediction that is formed, since the prediction is simply for the picture being 

decoded (which is either all top-field or all bottom-field). 

Figure 79 illustrates motion_vertical_f ield_select = 0, vector[0] - 0 

1 6(8) lines are read from the top reference field to form the prediction. 
25 Figure 80 shows motion_verticaLf ield__select = 0, vector{0] = 1 

1 7(9) lines are read from the top reference field and half-pel filtered to give the prediction. 

Figure 81 depicts motion_vertical_f ield_select = 1, vector[0) = 0 

16(8) lines are read from the bottom reference field to form the prediction. 

Figure 82 shows motion/vertical_field_se1ect = 0, vector[0] = 1 
30 1 7(9) lines are read from the bottom reference field and half-pel filtered to give the prediction. 

16x8MC 

In this mode, the macroblock is divided into two 16x8 regions, one above the other.: For. eachregion, a separate field 
35 vector is transmitted. Again, there are sixteen cases to consider (since there are four binary variables; 
motion_vertical_field_select for each of the two vectors and bit 0 for each of the two vectors): Again, these are too 
many cases to illustrate so the following figures need only deal with the upper 1 6x8 region. The lower region is obtained 
in an analogous manner. 

Figure 83 shows motion_vertical_f ield_select = 0, vector\0} = 0 
40 8(4) lines are read from the top reference field to form the prediction of the upper 16x8 region. 
Figure 84 depicts motlon_vertlcal_f leld_select = 0, vector[0] = 1 

9(5) lines are read from the top reference field and half-pel filtered to form the prediction of the upper 1 6x8 region. 
Figure 85 illustrates motion_verticaLf ield_select = 1 , vector\0] = 0 

8(4) Lines are read from the bottom reference field to form the prediction of the upper 16x8 region. 
45 Figure 86 depicts motion__vertical_fie1d_select = 1, vector[Q] = 1 

9(5) lines are read from the bottom reference field and half-pel filtered to form the prediction of the upper 16x8 region. 

Dual Prime in Field Pictures 

so Dual prime in a field picture is simply a special case of field prediction in a field picture. Two field vectors will be used 
(one will refer to the top reference field, on to the bottom reference field and the Parser will ensure this). One of the pre- 
dictions will appear to be making a backwards prediction, but because this is a P-picture, the prediction circuitry will 
interpret this as a second forward prediction. The two resulting predictions will then be averaged using the same cir- 
cuitry as that used for B-frame averaging. 

55 

Overall organization 

Figure 87 shows the overall organization of the display pipeline, in accordance with the present invention. Data anives 
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from the DRAM interface on a single multiplexed interface. Moreover, the DRAM interface will supply data in lines that 
are rounded up to the next 32 byte boundary above the correct number of bytes. However, the pels toward the end of 
the line that may lie outside the intended display area. 

In addition to the data the DRAM interface will supply one bit for each channel (Y, Cr and Cb) that indicates whether 
the byte is the last in the current display line. A further bit is supplied that indicates which field the data comes from. 
The first block in the display pipeline of the present invention splits apart the three channels. Chrominance (Cr and Cb) 
data is supplied to the vertical upsamplers 210. Luminance (Y) data can be delayed in a FIFO if desired. 
The vertical upsamplers 210 have the task of upsampling the chrominance data by a 2:1 factor so that there are as 
many lines of chrominance data as there are of luminance data. In order to do this the vertical upsamplers store each 
line of chrominance data and produce output pels that are interpolated between this line and the subsequent line. 
The next stage in the display pipeline is labeled "Horizontal Align 370". This is implemented as part of the horizontal 
upsampler 212 Its task is to align the data so that at the start of each line, the first pel of each of the three channels is 
supplied to the horizontal upsampler 212 correctly. At the end of each line, it is expected that, in general, the channels 
will "run out of data" at different times. The "Horizontal Align" block 370 has the job of discarding this extra data from 
the channels that have too much data while stalling the other channels so that they wait until all three channels are 
aligned and ready to commence the next display line. 

In the invention the hoc.zontai upsamplers 212 upsample the data horizontally to stretch the data to fill the glass of a 
TV screen In order to save silicon area, the filter is shared between the three channels. This can be done because the 
total output rate of the f iter must be 27 Mbytes/s (the clock rate). The data is multiplexed in the CCIR 601 order so that 
the data stream produced is simply multiplexed into the final data stream. 

Note that the horizontal upsamplers 212 merely take the amount of data supplied by the DRAM interface and scale it 
by a selected factor. In general, they will produce too little or too much data for the actual line length in the raster. This 
is handled in the output multiplex. 

Also note that the "Horizontal Align* block 370 does not need to know how many pels of each channel will be required 
to complete the line It is very difficult to calculate this number because the relation between the number of input pels 
to output pels for the upsampling filter is not very simple. The Horizontal Align block 370 simply supplies data to the hor- 
izontal upsampler 212 on each of its three channels "on demand," i.e., the horizontal upsampler "pulls" the required 
number of pels into it in the required order. At the end of the display line, one of the channels will run out of data first 
and this indicates that the remaining data for the other channels (if any) should be discarded. 

The VTG 333 simply counts through the raster and produces a series of timing signals that are supplied to the output 
multiplex 371 . Some of these signals are internal signals which tell the output multiplex 371 how to build the final raster. 
Other signals are "external" signals, such as sync and banking, and these are also supplied to the output multiplex 371 
circuitry so that they are delayed by the same number of clock cycles as the data. 

The output multiplex 371 block has several tasks. The most interesting of these is probably the task of removing the two 
wire -interfaceness" from the data. Data supplied from the horizontal upsampler 212 still has an associated valid signal 
(and the output multiplex provides an accept signal). Data at the output of the multiplex has no two-wire interface, it is 
simply clocked out, one byte per clock cycle. 

The output multiplex 371 also has the job of painting a border around the picture. The top and left borders are painted 
under the control of the VTG 333. The VTG 333 simply tells the output multiplex 371 to produce the requisite number 
of pels of border color. At the right and bottom of the picture, the output multiplex 371 paints its own border. i.e..it knows 
to do this because it runs out of picture data. ..,«.« ~. 

The final block in the display pipeline is the 8-bit to 1 6-bit output mode converter 372. This is quite simply a flip-flop and 
a multiplexer. It is intended that this be implemented at the output PAD itself. By doing this, it is possible to simply route 
an 8-bit bus, rather than a 1 6-bit bus. Each bit will go to two output pads. 

Horizontal Upsampler 
Introduction 

In accordance with the invention, the Horizontal Upsampler 212 performs the task of upsampling or interpolating the 

decoded picture in order to stretch it to f it the display raster. 

The upsampler 21 2 of the present invention can operate in four modes: 

1) 1:1 -Output is the same as the input 

2) 2:1 

3) 3:2 

4) 4:3 
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After some picture simulations and consideration of likely implementation costs, it has been decided to use a three tap 
filter to perform the interpolation. 

The filter is a "polyphase" filter in the sense that each successive output is generated using a different set of filter coef- 
ficients. The number of phases is always equal to the numerator of the upsampling ratio. Thus, the 4:3 upsampler has 
four phases, every fourth output sample being generated using the same filter coefficients. 

Since the upsampler 21 2 is generating more output data than it accepts as input data, it is clear that a new input sample 
is not accepted on every clock cycle. In fact, the number of phases on which the filter does not accept new input is the 
difference between the numerator and the denominator of the upsampling ratio. In each of the ratios (except 1 :1) this is 
one. Therefore, for each complete cycle around the phases, on one of the phases no new input data is accepted. In this 
case the data is the same as for the previous phase. The filter coefficients are, however, different to the previous phase. 

4:3 Upsampling 

In 4:3 Upsampling, the filter coefficients are shown in Table 44 while Figure 88 shows the filter in operation. The output 
pels are essentially formed as weighted averages of the input pels. 



Phase 


C[0] 


cm 


C[2] 


0 


0 


356 


0 


1 


42 


220 


-6 


2 


128 


128 


0 


3 


-6 


220 


42 



Table 44. 4:3 Filter Coefficients 



Note that no new input data is accepted before the final phase (phase 3) is calculated. 
3:2 Upsamp ling 

Table 45 illustrates 3:2 upsampling, while Figure 89 illustrates filter operation. 



Phase 


C[0] 


C[1] 


C[2] 


0 


0 


256 


0 


1 


68 


194 


-6 


2 


-6 


194 


68 



Table 45. 3:2 Filter Coefficients 



2:1 Upsampling 

Likewise, Table 46 illustrates 2:1 upsampling and Figure 90 shows filtering thereof. 
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Phase 


C[01 


cm 


C[2] 


0 


0 


256 


0 


1 


0 


128 


128 



Table 46 2:1 Filter Coefficients 



Note that Phase 1 could equally well have been described as having filter coefficients. 128. 128. 0. This has the advan- 
tage that the filter coefficients would then be the same as for Phase 2 of the 4:3 upsampler. However, it has the dead- 
vantage that the rule "no new input is accepted when computing the last phase" would not be true. 

Boundary Effects 

At the edge of the picture, it is necessary to produce output pels that are formed from pels that lie outside the picture 
area. In order to avoid this problem, it is necessary to pixel-repeat edge pels so that the filter may proceed without real- 
izing that it is at the edge of the picture. 

In the case of a three tap filter, as in the present invention, it is necessary to repeat just one pel at the left of the .mage 
and one at the right. (A five tap filter would have required 2 at he left. 2 at the right). This is shown in F.gure 91 
Conceptually, therefore, the implementation could be viewed as being formed of two boxes: 

Note the scheme doesnt really work properly in the case that the picture is not a multiple of 16 pels wide because the 
DRAM interface will always supply data that is a multiple of 16 pels wide. 

However although this problem is known, we are not going to do anything about it. Most picture are multiples of 16 pels 
wide anyway, and in any case it is only the very last pel of the line that can be affected by the boundary effect. Th.s is 
illustrated in Figure 92. 

The Number of Output Pels 

In the present invention, the upsampler will produce a defined number of output pels for a given number of input pels^ 
This is important because this allows the parser State Machine to decide how many pels will be produced at the output 
of the upsampler and. hence, how many pels need to be cropped (or border pels added) in order for the picture to fit 

ThVfistvSidoutput from the horizontal upsampler should occur in response to the third pel being input to the upsam- 
pler (since this is a three tap filter). Since one pel is repeated, this will occur when the second actual pel is input to the 

The^aTt valid output should occur when all of the possible output samples have been produced in response to the last 
(i e repeated) pel being input. Since the last phase of the poly-phase f ilters is computed using the same input data as 
the second-last phase, it is possible that either one or two output pels are produced as a result of this last repeated pel 
entering the upsampler. 

If this is done, the upsampler will produce "q" output samples: 

q = N(pDIVM) + (pREMM) EC > 1 - 

in response to "p" input samples for an N:M upsampler. 
For example, for a 43 upsampler. Table 47 could be drawn up as follows: 
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p 

(inDut Dels) 


q 

(outDut Dels) 


1 
i 


1 




2 


3 


4 


4 


5 


5 


• 

6 


6 


8 



Table 47. Number of Output Pels for 4:3 Upsampler 



Position Signals 

Two signals are transferred along with the video data in the present invention. They allow the output multiplex to ensure 
that the data is painted into the appropriate position in the output raster. These are: 

lastjnjine 
fieldjd 

lastjnjine is active for one pel time and signals that the associated pel is the last pel in a scan line. 

field Jd indicates which field the data belongs to. "0" indicates the spatially upper field. "1" indicates the spatially lower 

field. Note that this designation applies before any border lines and the like, are applied to the decoded image, f ieldjd 

changes state one pel too early, i.e., between the second to last and the last pel of the field. This allows the last pel of 

the field to be identified without waiting for the first pel of the next field. However, there may be no "next field" if decoding 

stops for some reason. The f ieldjd signal is shown in Figure 93. 

If a true field indicator is required, it can be obtained by delaying f ieldjd by one pel time. 

Since these signals work their way along side the data through the entire display pipeline, it is important to use two sig- 
nals, not three (which would allow a last pel in field signal) because it saves many flip-flops. 

Multiplexed data 

When position signals are applied to multiplexed data, care needs to be taken. 
The data is multiplexed in the order; C bY C rY . 

In the present invention, the three samples (C^ C r ) are co-incident in time and should, therefore, be viewed as indivis- 
ible. The remaining byte ( Y ) is positioned between the preceding (Cb Y C r ) pel and the subsequent (Ct, Y C r ) pel. 
As a result, the last byte in the line will either be the C r or Y . (Note that upsampling by 3:2 may produce an odd number 
of Y pels.) H the last byte in the lines is C n then, there should be a discontinuity in the multiplex signal because the first 
byte of a line is always C^: 

Horizontal Alignment 

At the input of the upsampler, there is no guarantee that the three different channels will line up. 

In order to achieve alignment, in the present invention, a "protocol" between the horizontal upsampler and the horizontal 

alignment blocks needs to be agreed. In accordance with the present invention, the protocol performs as follows: 
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• The horizontal block supplies pels, on demand, to the horizontal upsampler. When it runs out of data for a given 
channel, it will signal this to the filter using a signal marking the last pel of the line. This will only happen for the 
repeated pel. 

. The horizontal upsampler ensures that once it has been supplied the last pel from a given channel it will not ask for 
another pel from that channel in the current line. However, the filter continues to operate, taking any necessary pels 
from other channels, until just before it will demand a pel from the channel that it knows has run out of data. The 
filter marks the last pel it can produce at the output as the last in the line. At this point, it resets itself as ready for 
the next line of data. 

. When the horizontal upsampler sees the filter accept data for a channel that has already been exhausted, it knows 
that the filter is asking for the first pel of the next line. At this point, any remaining pels on the other two channels 
are discarded. The next pel that will be supplied on each of these channels will be the first pel of the line. 

Although it is convenient to think of two separate blocks (the horizontal alignment block and the horizontal upsampler 
filter) it is likely that the two will be implemented together, in order to explain the operation. 

Upsampling Ratio 

The upsampling ratio will be supplied to the filter as a two bit binary number. In order that the filter operates in a sensible 
manner, the upsampling ratio should be sampled, by the upsampler itself, once per field time. The circuitry supplying 
the ratio is then free to update the sampling ratio, in readiness for the next field, at any time during the current field. 
The ratio should be sampled as the first pel of each field is actually accepted (rather than just after the last pel of the 
previous field). In this way, the very first field after reset (or after some pause in decoding) is upsampled with the correct 
ratio. 

Video Timing Generator 
Introduction 

This section describes the video timing generator circuit (VTG 333) in accordance with the present invention. The VTG 
is primarily responsible for generating the various analogue video synchronizing signals, and also for maintaining 
knowledge of the display system's current raster position. This enables the VTG to provide controlling signals for the 
output multiplexer which selects between active video, border and blanking sources for output. Both analogue and dig- 
ital standards are supported, with two frame sizes (PAL and NTSC). and associated synchronizing behavior, selectable 
at setup. Border or cropping width will be specified in a token which will load a hardwired input to the VTG. 

Horizontal Timing 

The horizontal timing parameters are illustrated in Figure 94. These are split into those that are fixed (for either PAL or 
NTSC) and those that are variable (i.e., the parameters associated with any borders or cropping that may be specified. 
The interlaced nature of the video being displayed imposes a requirement for half-line based counting, so than various 
timing points are shown separately for each half of the line. 

A line comprises an initial banking period, the insertion of a SAV token, an active period, the insertion of an EAV token, 
and the trailing blanking period. During blanking lines, the active area will have blank values inserted rather than border 
and data. 

A line sync pulse appears at the beginning of every line (HSYNC). On certain blanking lines, two sync pulses appear, 
one at the beginning and the other after the first half line. The width of these is dependent on which vertical region is 
active: equalization or serration (field sync). 

During the initial horizontal blanking period, pels are discarded according to the cropping value (if the crop bit is set) - 
a fixed period of 120 cycles is allowed to discard the RHS cropped pels from the preceding line. The LHS pels for the 
current line are then discarded, and pels are stalled until the start of the active region. It is essential that there are no 
gaps in the data stream from which pels are being discarded, otherwise distortions will occur. 

If the crop bit is not set. however, a border is constructed by inserting border value for a period of borderL: followed by 
data for picture width, and then border again until the end of the active region. Note that it is not necessary to calculate 
the borderR value. 

The total horizontal border or crop width is specified in pels. The LHS border/crop value must be a multiple of 2 pels in 
order for the sampling to remain consistent. Consequently, it must be a multiple of 4 in terms of clock cycles. This can 
be achieved by masking out the least significant 2 bits from the original total border value in pels. For example, if the 
specified border is 91 pels, the left border will be 88 cycles long, and the picture width will be (720 - 91)*2 cycles. 
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Streams of pels arrrv.ng at the outpul max are padded to give blocks of 32 pels. Considering this, together with the scal- 
ing factors to be supported, the maximum number of pels to be received for a line will be 832. This means that the max- 
imum crop value will be 1 12 pels, giving 1 12 cycles of cropping at the LHS and the RHS 



Vertical Timing - PAL 

The vertical timing parameters for PAL in accordance with the present invention, are illustrated in Figure 95 Two fields 
are shown separately, as they have slightly different timing. Analogue parameters are indicated by the shaded regions 
being identical for each field, and digital parameters are shown by the waveforms. For simplicity, the zero-borde7case 
is shown If a non-zero vertical border is specified, border is inserted for a period of borderT. then data for picture height 
hen border aga.n until the end of the active region (fixed). BorderT and picture height are calculated in a way analogous 

° , ?! J 1 P,CtUre Width (i " horizontal « min 9) respectively. Once again, the initial border (borderT) must be a mul- 
tiple of 4, this time in terms of half lines because the top border must be a multiple of 2 

Note that MPEG codes 576 lines of video for PAL. whereas the analogue standard specifies only 525. This difference 
.s accommodated by selecting data for output for 576 hatf-lines per field, but only asserting the analogue banking signal 
for the requisite 575 lines. 

Vertical Timing - NTSC 

HSJ «?i vertica, I t L min9 ' in accordance with the P rese * invention, is illustrated in Figure 96. It is similar in principle 
to the PAL timing, although slightly more complex. MPEG codes 480 lines of video for NTSC. whereas the analogue 
standard specif ,es 483. This means that 3 lines of border must be inserted per frame to fill the gap (3 half lines per field) 
n addition the judicial vertical banking indicator, V, is specified in such a way that additional border lines are required 
to be inserted as padding before the active video lines. Non-zero vertical borders will be inserted in addition tothose 
either ^tandarT in the Previ ° US SeCtl0n ' Furth ermore. note that vertical cropping is not allowed in 

There is. at present, some uncertainty about the digital blank signal. V, since various reference sources give conflicting 
MSB re Jertive^ ^ ^ ""^ P 055 ™* 65, i,,ustrated bv v ^ V\ with the associated border select signals SB 

VTG Structure 

The video timing generator of the present invention comprises separate machines for the horizontal and vertical timing 
doma.ns.The vertical machine provides control signals for the horizontal machine, which, in turn, provides the half-line 
increment signal for the vertical counters. 
Inputs to the VTG are: 

clocks and reset 
• PAL not NTSC 

horizontal border value with crop indicator 
vertical border value 

Outputs are: 

horizontal, vertical and composite sync and blanking signals 
select signals for data, border, banking 
a discard data bit for cropping 
insert SAV and EAV 

F and V values for construction of SAV and EAV 
a 2-bit YUV position counter for SAV/EAV insertion 
a firstline bit to indicate the start of a picture at startup 

A '!k f * he ° utputs 9° t0 the ***** multiplexer block, including the sync signals which can then remain in synchronization 
witn the data. 

Horizontal Machine 

The horizontal machine is essentially a counter with hardware to delect the arrival of the various timing points as shown 
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in Figure 94 The count goes from zero to half line length (which is different for PAL and NTSC) and is repeated for each 
half line A hardwired comparator exists for each of the fixed timing points, these being activated according to the stand- 
ard In addition there is a register for the border value (which is polled once per field), a subtractor to determine the 
picture width and an auxiliary counter for counting down from the border value to zero. This procedure occurs in parallel 
5 with the main half-line coining. The datapath is 10 bits wide, and 1 5 hardwired comparators are required to implement 
both PAL and NTSC. The structure of this current embodiment is shown in Figure 97, together with approximate sizes. 
The datapath is estimated to be 360u x 330u. 

In addition to the datapath, most of the control logic in the VTG of the present invention will be associated with the hor- 
izontal machine. This will probably amount to 100-200 gates, 
yo Inputs to the horizontal machine are: 



clocks and reset 

horizontal border value and crop bit 
line, equalization or field sync indicators 
75 • PAL not NTSC 
vertical blank 
insert vertical border 

Outputs from the horizontal machine are: 

20 

horizontal and composite blanks 
insert data 

• insert border 
insert blank values 

25 • discard input 

• insert SAV or EAV, with YUV count 
hsync 

composite sync 
start of line 
30 • half-line increment 



Vertical Machine 

The vertical datapath has essentially the same structure as the horizontal datapath, but with 22 hardware comparators 
35 (8 for PAL, 14 for NTSC). The principal counter increments each half line, counting the half -lines through each half line, 
and counting the half-lines through each field, in turn. It is also 1 0 bits wide. 

Moreover, it is advantageous for test purposes to multiplex the half-line pulse input with another, more frequent clock, 
so that the vertical machine can be run independently of the horizontal machine. 
The estimated size is 360u x 420u. 
40 Inputs to the vertical machine are: 



clocks and reset 
• PAL not NTSC 

vertical border value 
45 • half-kline increment 



Outputs from the vertical machine are: 

select equalization, f ield or line sync 
so • vertical blank (analogue) 

• vertical sync 

F, V and V bits for SAV7EAV construction 

• insert vertical border 
insert data 

55 • insert blank value 
start of frame 
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Hardwired Comparator Design 

r^ eSM l m ^' 1 °"' ** c °""> aral ° r a «*>" is based on a siring of Senas n-we transistors either nre- 

Output Multiplex 

The output multiplex of the invention has the task of putting together the data for display. It combines data arrivino from 
' the earlier sect.ons of the display pipeline with timing information obtained from the VTG mm ° meS ° 3t3 3mV " 19 from 
The other input task of the output multiplex is to remove the two-wire interfacing. All the pipeline staqes uo to the onto... 
multipl ex have a two-wire interface, indeed the data arriving at the input of the output ^S^^^^SS 

in llT, h ^ ? ,aWn9 ^ ' 0W H0WGver - *"* is no in ^ace at the ou^uTof tHiST 

ZZtV * ? h ^fA? ab0 T e rem ° Val °' the IW °- Wire '"terfacing. the dynamics of the supply of 51 nei to S sorted 
out so ha the DRAM .menace never stalls the data arriving at the output of the horizont^upsanpfer 
Bas.ca.ly. the cutout multiplex is making a decision on a field by field basis as to whether to ou* 7a i ed of data or not 

22 Z 9 15 "K ^ ln - aCCept iS ,<W) th6n * ■« 10 outout the da * »• « the other hand the re is no va.S 
Sit f^ am P ,e ^efore the first picture has been decoded) then it will paint border color through the entire picture 

S^£«Z^2?!E- T C ° mP,i k 13603056 th6 ° U,pUt muH * ,te ™ st also ^t ^e da a is 

comrSnces 1 mUSl ^ Va " d Waiti ° 9 ,h3t be '° n9S l ° the eorrec » * ie,d b °'°'* the display 

If at some point thedata ceases to be valid, at a time that the output multiplex expects to have valid data availablp 
Border Generation 

Se U ntio 9 n 8 ^ *" 9enerati ° n °' "** to th6 " ld right ° f the picture <**» in accordance with the present 
As shown, the VTG generates the border region at the left of the picture by asserting a signal that selects bordpr ™inr 

b^er co.or * reC09niZin9 *~ " ° Ut " ° f d3t3 and »"** remaind * «* *• «S JIi 

It must be understood that there are two possible interpretations of "run out" of data Ohp « <hat «, h . 
horizontal up^rnpler is not vaiid. However, this is no. vlt is mean, here. VlZSZ oZ^oM date aftef the S 

V the 1 ,3 ! ,JnJine ^ as 9° in 9 the las, one in the line has been included in the outout stieam Se 
99 shows the equivalent action when clipping of the pictore occurs 9 

As shown toe VTG signals to the output multiplex to clip pels to ,he left of toe picture by asserting a sionaJ to tell the 

thorny m ,scard : nput pels - 0,106 this has occurred - *■ to wm ,na « 

ouS^ v H a,n,n ^ peis - At 016 end ot the active ,ine (ie - 720 p e,s iate ? tne vtg de?^sc aral ; 

output multiplex discards any remaining pels in toe data on its input. Note that in general there wHI bp S 
between the time when toe VTG indicates that cropping shouldTccur and toe stert of toe aZl E^XSKJ 

^« ^^St2 ou * ut mu,tip,ex disoafds P6fe wh6n the °- * ^SES3 

Output multiplex 

d^ te 0 stoJanr U ' tiP,eX C ° n,r0,S *• mU,tiplexin9 01 various "> urces * ^ta togetoer to form a CC.R 601 8 M multiplexed 
The timing (i.e. what is multiplexed in and when) is largely controlled by toe VTG. The output multiplex is concerned 

tbL SbP n« nr SU t!; H° f 3 * th6 St3rt °' deCOdi " 9 ' when no ™ availabtefor dhXta 

S^7hL h P T? 6r f ° r ^^S" 0 " 1 ,he entire ^9e. Eventually, the first decoded picture will anive^he out- 
put O, toe horizontal upsampler. Typically, this will no, occur conveniently at the start of the field The oZtrt multinit 

dat Z e K Per UB VT " iS th6re ^ re3dy for disp,a ^ ? " » no «- ■ wai ^ for the iSiSSt ScurS 
date that happens to turn up in the meantime has to wai, for toe star, o, the next field) 

2 ^^SZSSZS* the correct f ield of data ar ™ ng ,rom ,he SDRAM interiace is P3intad ^ 
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In addition, todealing with the data, the output multiplex also selects the correct sync and blanking signals for outputting 
to the pins. This facilitates easy connection to a wide range of composite encoders. DAC's. and the lite. Reregisters 
for the output multiplex are as shown in Table 48. The bits for the output multiplex control are illustrated in Table 49. 
There are four bytes of MPI registers associated with the output multiplex: 



10 


Register Name 


Size/Dir. 


Reset State 


Description 




border_cb 


8 


OxCO 


Cb component of border color 


15 


border_y 


8 


0x80 


Y component of border color 




border_cr 


8 


0x40 


Cr component of border color 


20 


outmux_ctrlL 


8 


zero 





Table 48. Outmux registers 



30 


Register Name 


CD 


Reset State 


Description 




hs/cs 


0 


0 


Controls whether horizontal sync or composite 


35 






sync is present on the hcsync pin. 










0 selects composite sync 










1 selects horizontal sync 


40 


hcsync_ah 


1 


0 


Controls the parity of the hcsync pin. 

0 selects active low 

1 selects active high 



so 



55 
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Register Name 



vsync_ah 



cblank ah 



blanking601 



enb!_sav_eav 



CD 



0> 
TO 

to 

CO 

0) 



Description 



0 J Controls the parity of the vsync pin. 

0 selects active low 

1 selects active high 

0 j Controls the parity of the cblank pin. 

0 selects active low 

1 selects active high 

Controls and value of ,un ™ data that is output 
during blanking. 

0 selects the value zerol 
selects the value 0x10 (sixteen) 
For CCIR 601 data this pin must be set to 1. 

Controls the generation of SAV and EAV control 
words in the output stream. 

0 suppresses SAV and EAV, in which case, 
blanking values are output at the times when 
SAV and EAV would otherwise be generated. 

1 enable SAV and EAV. Note that blanking601 
should also be set to 1 to avoid the value zero 
appearing at the output except during SAV and 
EAV. 

For CCIR 601 data this pin must be set to 1. 
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Register Name 




t State 


Description 


blank_screen 


6 


0 


When set to 1, this bit causes border color to be 
painted over the entire screen, thereby blanking 
the screen. Note that decoding continues as 
normal, but the decoded pictures are rendered 
invisible. 


vblank 


7 




This is a read-only bit (data written to this bit is 
ignored). It indicates vertical blanking. 



Table 49. Bits from Outmux_Ctrt 



a. Irrespective of the setting of this bit, chrominance data (both Cb and Cr) will be 0x80 (128 decimal) during blank- 
ing. 

Video Decoder Specifications and Features 

In addition to the aforementioned detailed description; the following disclosure is also provided regarding a preferred 
embodiment of a video decoder suitable for practice of the invention. 



• MPEG-2 MP @ ML 


♦ 2/3 and 1/1 pull down 


• Single 16 Mbit SDRAM 


• Video scaling 


• High resolution MPEG-1 


• Power including SDRAM « 2.5 W 


• a Vision compatible 


• Self configuring 


• Automatic error concealment 


• Small board area 


• Channel change support 


• QuietPad™ outputs 


• Time stamp management 


• On-chip video timing generator 



The present invention includes a highly integrated, easy to use. MPEG-2 video decoder. It fully supports all the require- 
ments of MPEG-2 Main Profile at Main Level. 

The system of present invention is also self configuring (a single pin selects between PAL and NTSC operation) and, in 
many applications; can start-up and maintain video decoding with no external software support Error concealment and 
recovery is fully automatic. More demanding applications may utilize the advanced features controlled by software run- 
ning on an external microprocessor. 

The present invention stores its own microcode in an on-chip ROM, thus avoiding the need to use an external ROM or 
download microcode before decoding can commence. See Figure 100. 

The following more detailed description of the system of the present invention is set forth for purposes of organization, 
clarity and convenience of explanation under the headings listed below: 
Signals 
Register map 
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Power supplies 

Logic levels 

Clock signals 

Reset signals 
5 Coded data interface signals 

Supply data via the microprocessor interface 

Switching between input modes 

Rate of accepting coded data 

Coded data interface timing 
io CDCLOCK 

Video output signals 

Video output control registers 

Borders, scaling and cropping 

Video output control registers 
15 Video signal timing 

MPI signals 

MPI electrical specifications 
Interrupts 
Page register 
20 SDRAM interface signals 
SDRAM configurations 

Connection of JTAG pins in non-JTAG systems 
Supported Instructions 
Characteristics 
25 Level of Conformance to IEEE 1 149.1 
Start code detector registers 
Detection of start codes 
discard.all facility 
f lag_picture_end facility 
30 start__code_search facility 
SCD example - channel change 
Parser registers 
Error codes 
Dealing with user data 
35 System organization 
Signals and registers 
Electrical specifications 
Coded data interface 
Video output interface 
40 Microprocessor interface 

Synchronous DRAM interface 
JTAG interface 
Start code detector 
Video parser 
45 Timestamp management 

Address generator configuration 
Mechanical information 



50 



«Trj S « S S i0n inC ' U r d ^ 8 ' istin 9 of a " the si 9" a,s Cpins) used, in accordance with the present invention, and a listing of all 
the registers available through the microprocessor interface. (See Tables 50 and 51 .) 
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10 



15 



20 



25 



30 



35 



40 




45 



50 



55 
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w 



15 



20 



25 



30 



35 




40 



Table 50. Signals 
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50 
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Signal Name 
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I/O 



Pin Number 



Description 



TPHOISH 



TPH1ISH 



87 



88 



w 



TSTRSTCTRL 



TLOOP 



PLLSELECT 



77 



78 
83 



Connect to GND or VDD during nor mal operation 

If PLLSELECT = 0 the on-chip phase locked loops are 
disabled. 

Set PLLSELECT = 1 for normal operation. 



15 



20 




PLLLOCK 
TDCLK 



Table 51. Test Signals 



25 



30 



Register Map 

The register map of the present invention is divided into areas. The first 32 locations are required for the normal oper- 
ation of the system. There is only five bits of address. 

The next set of 32 locations are those in the address generation circuitry that are required to setup a non-default 
SDRAM memory map. 

The remainder ol the register map are registers that are only used for test and diagnostic purposes. These can be 

paged in instead of the address generator registers. 

Table 52 illustrates the register map of the present invention. 



35 



40 



45 



Address (hex) 


Interrupt Service | 


See | 


0x00 ... 0x03 


Interrupt service 




0x04 ... 0x05 


Input circuit 




0x06 ... 0x07 


Start code detector 

■ -* 




0x08 ... 0x0a 


i 

^imestamp insertion 




0x0b ... OxOf 


(not used) 




0x10 ... 0x17 


Parser 
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10 



15 



Address (hex 

0x1 8 ... 0 x1c 
0x1 d 
0x1 e 



0x1 f 
0x20... 0x3f 



Interrupt Se rvice 

Output cont rol 

PLL control 

DRAM PAD drive 
strength 



See 



page_select* 



paged register access 



Table 3-4 



20 



Table 52. Overview of Register Map of Present invention 

In normal operation, page_select should hold the value 7 Bm ir, 

0x20 ... 0x3f wi., contain the address general user registers '° Cati0nS 

Table 53 depicts the page select register. 
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page-select 



1 



Addrgen user configuration r egisters 
Built in self test and IDCT test registers 

IM_plus test registers and SCD test registers 

Parser test registers 
Field/Frame test re gisters 
BOB test registers 
more BOB test reg isters 
Addrgen tes t registers 
DRAM IF test regis ters 

Table 53. Page Select Register 




Table 3-5 

Table 3-1 1 
Table 3-1 2 

Table 3-13 
Table 3-1 4 

Table 3-15 

Table 3 -16 

Table 3-17 

Table 3-17 

Table 3-18 

Table 3-19 



so Table 54 illustrates the interrupt service area. 
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Address 
(hex) 


Bit No. 


Register Name 


See Page 


I 0x00 I 


7 | 


chip_event | 






6 | 


end_search_event | 


1 




5 I 


unrecognized_start_event 


1 




4 | 


f!ag_picture_end_event | 


[ 




3 | 


parser_event | 


1 




2 




1 




1 | 




1 




o I 




1 


0x01 


7 J 


chip_mask | 


1 




6 | 


end_search_mask 






5 | 


unrecognized^start^mask 






I 4 


flag_picture_end_mask 


1 1 




I 3 


i parser_mask 






I 2 








| 1 








I 0 






0x02 


7 


idct_too_few_event 






6 


idct_too_many_event 






5 




- 




4 








3 








2 








1 








0 


\ watchdog_event 
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Address 


Bit No. 


(hex) 





10 



15 



20 



0x03 



7 
6 
5 
4 
3 
2 



1 



0 



Register Name 

idct_too_few_mask 
idctjoo_many_mask 



See Page 



watchdog^ma sk 
Table 54. Interrupt Service Area 



25 



Table 55 shows the input circuit registers of the present invention. 
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35 




40 



Table 55. Input Circuit Registers 

Table 56 shows the start code detector register of the present invention. 



45 



SO 
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I Address (hex) 


| Bit No, 


I Register Name 


I See Page 


0x06 


7 


scdp_access 






6 


(not used) 






5 


discard_extension 






4 


discard_user 






3 


after_search_stop 






2 


flag_picture_end 
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Address (hex) 


Bit No J 


Register Name 


See Page 




1 


after_picture_stop 




0 


after_picture_discard 




0x07 


7:3 


(not used) 




2 


discard_ali 




1:0 


start_code_search 





Table 56. Start Code Detector Registers 



In accordance with the present invention, Table 57 shows the timestamp insertion registers. 



Address (hex) 


Bit No. 


Register Name | See Page 


0x08 


7:0 


ts_high 




0x09 


70 


ts_low 




0x0a 


7 


ts_valid 




6 


ts_waiting 




5 0 


(not used) 





Table 57. Timestamp Insertion Registers 
Likewise, Table 58 illustrates the video parser registers. 



Address 
(hex) 


Bit No. 


Register Name 


See 
Page 


0x10 


7:0 


parser_ctrl0 (actually a reg file location - bits TBD) 




0x11 


7:0 


parser_ctrl1 (actually a reg file location - bits TBD) 




0x12 


7:0 


parser_error_code (actually const, field of MSM ) 




0x13 


7 


parser_access 




6:0 


reg_keyhole_addr 




0x14 


7:0 


reg_keyhole_data 




0x15 


7:0 


(not used) 




0c16 


7:0 


user_keyhole_addr 





105 



EP 0 891 094 A2 



w 



Address 
(hex) 


Bit No. 


0x17 


7:0 



Register Name 




user_keyho!e_cfata 



Table 58. Video Parser Registers 



The output control registers are shown in Table 59. 
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Table 59. Output Control Registers 



Test Registers 

The complete register map is shown in Table 60 through Table 69. 



50 
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Address (hex) 


Bit No. 


Register Name 


See Page 


P1+00 




test_mode 




P1+01...P1+03 




(not used) 




P1+04 




misr_mask 




P1+05 




(not used) 




P1+06 




misr[1] 




P1+07 




misr[0] 




P1+08 




psrg_bit_select 




P1+09 




psrg_constant 




P1+0a...P1+0c 




(not used) 




P1+0d 




psrg[2] 




P1+0e 




psrg[1] 




P1+Of 




psrg[0] 





Table 60. Built-in Self Test Registers 



Address (hex) 


Bit No. 


Register Name 


See Page | 


I P1+10 




idct_clkgen 




I P1+11 




(not used) 




I P1+12 




snpjdct[1) 




I P1+13 




snp_idct[0] 




I P1+14...P1+17 




not used 




I P1+18 




snp_tram[7] 




I P1+19 




snp_tram[6] 




I P1+1a 




snp_tram[5] 




I P1+1b 




snp_tram[4] 




I P1+1C 




snp_tram(3] 




I P1+1d 




snp_tram[2] 




| P1+1e 




snp_tram[1] 
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Table 61. IDCT Test Registers 



Table 62. IM_plus Test Registers 



P2+00 




imp_clkgen 




P2+01 




(not used) 




P2+02 




snpjquant[1] 




! DOa.ri'3 




snp_iquant[0] 




I DOj.A/1 




(not used) 




| P2+05 




snp_imode[1] 




| P2+06 




I snpjmode[1) 




| P2+07 




| snpjmode[0) 




[ P2+08 




snpjquant_ram[3) 




1 P2+09 




snp_iquant_ram[2] | 




P2+0a 




snp_iquant_ram[l] 




1 P2+0b i 


• 


I snp_iquant_ram[0] 




1 P2+0c 




iquant_keyhole_data 




P2+0d | 




iquant^keyhole^addr 




1 P2+0e.,.P2+0f | 




(not used) 




P2+10 [ 




snpjzz_ram[3] 




I P2+11 | 




snpjzz_ram[2] 




I P2+12 | 




snpjzz_ram[1] 




1 P2+13 | 




snpjzz_ram[0] 




P2+04 T 




izzJ<eyhole_data 




| P2+15 I 




izzJ<eyhoIe_addr 




P2+16...P2+17 | 




(not used) 
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Address (hex) 


Bit No. 


Register Name 


See Page 


P2+18 




scd_clkgen 




P2+19 




(not used) 




P2+1a 




snpjncrct(t) 




P2+1b 




snp_incrct[0] 




P2+1C 




snp_cdbin[1] 




P2+1d 




snp_cdbin[0] 




P2+1e...P2+1f 




(not used) 





Table 63. SCD Test Registers 



Address (hex) 


Bit no. 


Register name 


See page 


I P3+00 




parser_clkgen 




I P3+01...P3+02 




(not used) 




I P3+03 




snp_cdbout[4] 




I P3+04 




snp_cdbout[3] 




I P3+05 




snp_cdbout[2] 




I P3+06 




snp_cdbout[1] 




I P3+07 




snp_cdbout[0] 




I P3+08 




(not used) 




I P3+09 




snp-aluin[2] 




I P3+0a 




snp-aluin[1] 




I P3+0b 




snp-a!uin(0] 




I P3+0c...P3+0f 




(not used) 




P3+10 


7 


msm_access 




6:0 


(not used) 




I P3+11 


7:3 


(not used) 




2 


mssr_intr_status 




1 


mssr_ss_monitor 
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Address (hex) ] Bit noTT Register name | See paoe 





0 


mssr_ss_seiect 


I P3+12 


7 4 


1 \nui US6Q j 




| 3:0 


msm_pc 


P3+13 


7-n 




P3+14 


I 7/1 


1 (not used) 






msm_call_return 


! P3+15 


t 7-n 


I ™ 

■ * ** 


P3+16 

ft ^™ • 


1 74 

1 ' - M 


y — 

[ » (not used) 




3 0 


rnsrn mir return 


| P3+17 


70 i 




I P3-H8 




di ip_ user_ram[ ( j 


| P3+19 




snD u^pr ramffil 


I .. P3+1a | 




snp_user_ram[5] 


I P3+1b ( 




snp_user_ram[4] 


P3+1c j 




snp_user_ram[3] 


P3+1d J 




snp_user_ram[2] 


P3jMe | 




snp_user_ram[1] 


P3-Hf [ 




snp_user_ram[0] j 



Table 64. Parser Test Registers 



j Address (hex) 


| Bit No. 


Register Name 


J See Page ) 


P4+00 




ff_clkgen 




P4+01 




(not used) 




P4+02 




snp_fldjrm[1] 




P4+03 




snp_fld_frm[0] 




P4+04 




snp_padder_data[1] 




P4+05 




snp_padder_data[0] 
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Address (hex) 


Bit No. 


Register Name 


See Page 


5 | 


P4+06 




snp_padder_pf[1] 






P4+07 




snp_padder_pf[0] 




10 I 


P4+08 




snp_pf_master(3] 
(snpsel[31) 






P4+09 




snp_pf_master[2] 
(snpsel[21 




15 


P4+0a 




snp_pf_master[1] 
(snpselM 1) 






P4+0b 




snp_pf_master[0] 
(snpselfOl 




20 


P4^0c 




snp_pf_slave[3] 
(snpsel[71) 






P4+0d 




snp pf slave[2] 
(snpsel[6]) 




25 


P4+0e 

* 




snp pf slaveM] 
(snpsel[5]) 




30 


P4 + 0f 




snp_pf_slave[0] 
(snpsel[4]) 




P4 + 10 




(not used) 


■ 




P4 + 11 




snp pf pipe[2] 
(snpsel[10] 




35 


I P4+12 




snp_pf_pipe[1] 
(snDselOl 




40 


P4 + 13 




snD Df DiDefOl 
(snpsel[8) 




1 P4+14 




ff_keyhole_data 






| P4+15 




ff_keyhole_addr 




45 


| P4+16 




snp_dec_data[1] 






| P4+17 




snp_dec_data[0] 






| P4+18 




snp_ff_ram[7] 




50 


I P4+19 




snp_ff_ram[6] 






| P4+1a 




snp_ff_ram[5] 
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| Address (hex) | Bit No. 


Register Name 


| See Paqe 


P4+1b 




snp ff ram[4] 




P4+1c 




snp_ff_ram(3] 




P4+1d 




snp ff ram|2] 




P4+1e 




snp_ff_ram[1] 




P4+1f 




sn P_ff_ram[0] 





Table 65. Field/Frame Test Registers 
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Address (hex) 


Bit No. 


Reqister Name 


See Page 


P5+13 




snp vtqIO] 




P5+14 




snp_outiface[1] 




P5+15 




snp_outiface[0] 




P5+16...P5+1f 




(not used) 




P6+00...P6+07 




snp_vupram_cb1(7:0] (bobupram) 




P6+08...P6+09 




snp_vupram_cb0[7:0] 




P6+10...P6+17 




snp_vupram_cr1 [7:0] 




P6+18 ...P6+1f 




snp_vupram_cr0[7:0] 





Table 66. BOB Test Registers 



Address (hex) 


Bit No. 


Register Name 


See Page 


P7+0 




addrgen_clkgen 




P7+1 












snoopers 





Table 67. Addrgen Test Registers 



Address (hex) 


Bit no. 


Register Name 


See Page 


P8+0 




dram_clkgen 





Table 68. DRAMIF Test Registers 



Summary of Test Register Locations 



Address (hex) 


Data Bits 


Register Name 


Location | 


P2+1a...P2+1b 


10 


snpjncrct[1:0] 


The input of the chip 
(before the input circuit) 


P2+1c...P2+1c 


10 


snp_cdbin[1:0] 


Input of cdbin 


P3+03...P3+07 


33 


snp_cdbout[4:0] 


Input of cdbout 


P3+09...P3+0b 


19 


snp_a1uin[2:0] 


Input of the ALU in the 
MSM 
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Address (hex) 



Data Bits | Register Name 



Location 



P2+05...P2+07 


19 


snp_imodel(2:0] 


Input of the inverse 
modeler 


P2+02 ...P2+03 


13 


snpjquant(1:0] 


Input of the inverse 
quantizer 


P1 +1 2 P 1 +1 ^ 


I o 


snp_idct[1 .0] 


Input of the IDCT 


P4+02 P4+01 


I U 


snp_Tia_trm[1 .0] 


Input of field-frame 


P4+04 P4+05 




5np_paaaer_ aata[ i .oj 


Transform data input of 
pfadder ] 


P4+06...P4+07 


8 


oiijj pauuer_pT[i .uj 


Pred. filter data input of j 
pfadder 


P4+08 . P4*0D 


23 


31 'r„ r"^^^ 1 inasicrio.UJ 


Master input of predflt 


P4+0c...P4*0f 


23 


snp_padder_master[3:0] 


Slave input of predflt j 


P4+11...P4+13 




snp_pf_pipe[2:0] 


Half way through predflt 


P4 + 16...P4+17 


8 


snp dec dataft O] 


Output of prediction adder ! 


P5+02...P5+03 


10 


snp_vup_cb[1:0) 


Input of chroma upsample 
Cb 


P5+04...P5+05 




snp_vup_cr[1:0] 


Input of chroma upsample 
Cr j 


DC, i nr . 


12 


snp_hup_y[1:0) 


Input of horizontal 
upsampler y 


P5+08...P5+09 


10 


snp_hup_cb[1:0] 


Input of horizontal 1 
upsampler Cb 


P5+0a...P5+0b 


10 


snp_hup_cr[1:0] 


Input of horizontal 
upsampler Cr 


P5+Od...P5+0f 


10 + 

strobes 
from vtg 


snp_outmux[2:0] 


Input of outmux 


P5+11...P5+13 




snp_vtg_[2:0] 


All control inputs for VTG 


P5+14...P5+15 


13 


snp_outiface[1:0] 


Just before 8 to 16 
converter and retiming for 
the pins I 



Table 69. Snooper Registers 



Power Supplies 
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| Symbol | 


Parameter | 


Min. ] 


Max. 


Units 


VDD 


Nominal 5 V supply voltage 
relative to GND 


-0.5 


6.5 


V 


VCC 


Nominal 3.3 V Supply voltage 
relative to GND 


-0.5 


6.5 


V 




Input voltage on any pin except 
SDRAM interface pins 


(jND - U.D 


VUU + U.D 


V 




Input voltage on any SDRAM 
interface pin * 


GND - 0 5 


VCC + 0.5 






Operating temperature 


-40 


+85 


°c 


T 5 


Storage temperature 


-55 


+ 150 


°c 



Table 70. Suggested Specification Ratings b 



• D(15:0], DA[11:0], DCKE, DCLKOUT, DCLKIN, DWE, DCAS, DRAS DCS(1:0] and 
TDCLK. 

b Stresses greater than those listed here may cause permanent damage to the device 
This is a stress rating only and functional operation of the device at these, or any other 
conditions above those indicated in the operational sections of this specification, is not 
implied. Exposure to absolute maximum rating conditions for extended periods may 
affect reliability. 



Symbol 


Parameter | 


Min. I 


Max. J 


Units j 


VDD 


Nominal 5 V supply voltage 
relative to GND 


4.75 


5.25 


V 


VCC 


Nominal 3.3 V Supply voltage 
relative to GND 


3.00 


3.60 


V 


GND 


Ground 


0 


0 


V 




Operating temperature 


0 


70 




'oD 


RMS power supply current 






mA 



Table 71. DC Operating Conditions 



Logic Levels 

Three different signal interface types are implemented in accordance with the present invention. Standard (5 V) TTL lev- 
els are employed by the microprocessor interface. In addition, 5 V CMOS levels are used by the coded data interface 
and the video output interface. 3 V LVTTL levels are also employed by the SDRAM interface. 



TTL (5 V) Levels 



EP 0 891 094 A2 



10 



15 



20 



25 



30 




b 

C 



Table 72. TTL (5 V) DC Characteristics 



AC input parameters are measured at a 1 .4 V measurement level 

'o ^ 'OOC min- 

greater'** S,e3dy ^ ^ Capability of the iri »erface. Transient currents ma be much 
Whenasserted the open collector IRQoutput pulls down with an impedance of 100 O 



35 



CMOS (5 V) Levels 



40 Sle 73^^ ^S7^^ ?0% " V °° ^ V,Lm " IS 3 ™ ^ V D[> The values shown in 

laoie /a are those for V (H and V, L at their respective extreme limits of operation. 



45 



50 
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Symbol 


Parameter 


Min. 


Max. 


Units 


VlHcmos 


Input logic V voltage 


3.68 


VDD + 0.5 


V 


v 

ILcmos 


Input logic '0' voltage 


GND - 0.5 


1.43 


V 


^OHcmos 


Output logic T voltage 


Voo-0.1 




V 


Voo-04 




V 


Votcmos 


Output logic '0' voltage 




0.1 






0.4 


v d 


l|Ncmo» 


Input leakage current 




± 10 






Input capacitance 




5 


PF 


CoUTNcmos 


Output/IO capacitance 




5 


pF 



Table 73. CMOS (5 V) DC Characteristics 

• i 0 " < 1 mA 
b l OH < 4 mA 
c l 0L < 1 mA 
" U. < 4 mA ' 



LVTTL (3.3 V) Levels 



Symbol | 


Parameter 


Min. 


Max. 


Units 




Input logic T voltage 




VCC + 0.5 


V* 


v 

v lLsdr»m 


Input logic '0' voltage 


GND - 0 5 


0.8 


V 


v OLso»am 


Output logic 0" voltage 






V 




Output logic T voltage 






V 




Output current 


± 100 




MA* 


^OZsdram 


Output off state leakage current 




±20 


mA 


l|Nsdram 


Input leakage current 




± 10 


/iA 


C|Nsdr»m 


Input capacitance 




5 


PF 


^OUTsdram 


Output/IO capacitance 




5 





Table 74. LVTTL (3.3 V) DC Characteristics 



1 AC input parameters are measured at a V measurement level 
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gr h eiter the ^ dr ' Ve Capab,l,,y of the ,nterf sce Trans.ent currents ma be much 



10 



15 



20 



25 



30 



Clock Signals 

t^^SSS^"^ T '° Cta * 00de0 d,B '™> ,h » *»« *~«to>. ™» <*«* ™>y b. synchro- 

-wwemem, *tne SYlc L £T. r e 5SU5^^ P " al *° 10 **• ™*«»» '<"«° ^ 



Num. 


Characteristic 


27 MHz 


Unit 


Note 


Min. 


Max. 


1 


Clock period 


37 




ns 




2 


Clock high period 


10 




ns 




3 


Clock low period 


10 




ns 
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Table 75. Input Clock Requirements 

Note that the tolerance and stability of the clock must be adequate to comply with the 
line frequency of the appropriate video standard. V 



40 Reset Signals 

The present invention uses three reset signals: 



45 



1) RESET 

2) VTGftESET 

3) TIMERESeY 



60 



,0 WWi ° ^ 9enera,0r * the PfeSent inventi0 " -««*0 other aspects of the 

" MERESET is used b * the «"iestamp handling circuitry in accordance with the present invention. 



55 Introduction 



The coded data interface, in accordance with the present invention provides a dedicator. C£> » «» 

•° sopp'y tile coded video data to system. Alternatively, coded data ^e^^nS^o^'Soe^ 



118 



EP 0 891 094 A2 

section discusses both of these methods. . o „ Thc 

If the dedicated pins are used, coded data may be supplied either as a s.mple stream of bytes or as Tokens. The 
Tokens allow other types of information to be supplied in addition to the coded data. For example, time stamp .nforma- 
tion may be transferred using this mechanism. . 
If the microprocessor interface is used for coded data, then Tokens are always used. Furthermore, this is quite s.mple. 
Once a -Token Head" has been written to declare that subsequent data is coded data (requiring just two registers to be 
written) coded data may. thereafter, be simply written into a register. 



10 



Coded Data Interface Signals 

Table 76 defines the coded data interface signals used in the present invention. 
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Signal Name 



CD[7:0] 



CDEXTN 



CD VALID 



CDACCEPT 



BMODE 



Description 

Coded data is supplied to the present invention one byte at a 
time Data is sampled at the rising edge of CDCLOCK. 
Data is assumed to be byte-aligned 



When the coded data interface is used to transfer Tokens, 
this signal is the extension bit. This signal is sampled at the 
sa me time as CD[7:0] 

CDVAUD is sampled at the same time as CD[7:0]. When it 
is HIGH, the data is valid and is used as coded data. When 
it is LOW, the data is not valid and is ignored by the system. 



CDACCEPT indicates the readiness of the system to accept 
data. When it is HIGH, at the rising edge of CDCLOCK data 
will be latched as expected. When it is LOW. the system 
cannot accept the data (presumably because its internal 
buffers are full) and. therefore, the data should be presented 
again. 

When this signal is HIGH, data is interpreted as a simple 
stream of coded data bytes (and CDEXTN is ignored). When 
it is low data is interpreted as Tokens. This signal is sampled 
at the same time as CD[7:0] 



40 
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Signal Name 
CDCLOCK 



50 



Type I Description 

This clock is used to control the transfer of data into the 
system. CD[7:0] CDEXTN BMODE and CDVALID are 
sampled at the rising edge of CDCLOCK and external 
circuitry should sample CDACCEPT atthe same time. 

Note that in the default (reset) condition, CDCLOCK and 
SYSCLOCK must be connected to the same signal. 



Table 76. Coded Data Interface Signals 



CDVALID and CDACCEPT are used to control the transfer of data in accordance with the present invention. This type 
of protocol is referred to as a "two-wire" interface. Both signals must be high at the rising edge of CDCLOCK in order 
for a data transfer to occur. Figure 102 shows the relationship between the data (CD[7:0], CDEXTN and BMODE) and 
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CDVALID and CDACCEPT. 

SSL" da - a . iS J- 0 be SUPP ' ied Via the C0ded da,a in,erface P ins - the micropocessor interface reaisler 
enable_mpi_input- must be zero (this is its reset state). 'menace register 

Byte Mode 

i" 'k 6 P Kt Se H! i ^ enti0n ' if BM ° DE iS Samp,ed HIGH 31 the risi "9 edge of CDCLOCK (and CDVAUD and CDACCEPT 

2ZS^Z£T is frea,ed as simp,e 00d8d da,a ,n ,act - ttie da,a is imm ^ ^^■SSfSE 

Token Mode 

iE^TSS^ 31 riSin9 ^ 01 CDCLOCK ^ CDVAUD 3nd CDACCEPT are N«. then the 

th^ntnTr^ V enS il e ' y ' n accordance with the P f esent invention, to control the flow of data and control signals 
throughout the system. Theoretically, it is possible to supply any Token at the coded data input 

-.11^" accordance with the present invention, consist of a series of bytes (CD[7:01) each of which has associ 

Sf SSiT Tfn Wt (CDEXTN) - ^ ,ifSt 5yte * ,he Toke " indica,es »• type of infom^on 
The last byte of the Token is indicated by the extension bit being LOW 

For example coded data is supplied using the DATA Token. This is illustrated in Figure 103 As shown the first hvto fe 

SSSKSSSiS^ next data ,hat ,s wiM be interpreted as °* e <* a n - «** (— « * 

£££ « T ° ka T th l iS Particu,ar| y useful is the FLUSH Token This Token acts like a "reset" and it may be used after the 
end of one video stream in order to ready the system for the next video stream. The FLUSH Token isluslratSin Figure 

Supply Data via the Microprocessor Interface 

!^ H? ent ir \ vention ' token s can be supplied to the system via the microprocessor interface (MP I) by accessino the 
coded data input registers. Table 77 defines the coded data input registers V access,n9 the 
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Addr. (Hex) 


Bit No. 


Dir/Reset 


Register Name 

r 


Description 


04 


7 


RO/1 


coded_busy 


The state of this registers indicates it 
the system is able to accept Tokens 
written into coded_data[7:0]. 

The value 1 indicates that the interface 
is busy and unable to accept data. 
Roha\/inr is undefined if the user tries 
to write to coded_data when 
coded_busy = 1. 




6 


RW/0 




Controls whether coded data input to 
the system is via the coded data port 
(0) or via the MPI (1). 




5 


RW/x 


coded_extn 


The extension bit of the token data 
written into coded_data 




4:0 


(not used) 






05 


7:0 


RW/x 


coded_data 


Token data is written into this location. 



Table 77. Coded Data Input Registers 



Writing Tokens vis the MPI 

* « 

The coded data registers are grouped into two bytes within Ihe memory map to allow for eff icient data transfer. The 8 
data bits, coded_data[7:0J. are in one location and the control registers. coded_busy. enable.mpi-input and 
coded extn are in a second location. (See Table 56.) 

When configured for Token input via the MPI. the current Token is extended with the current value of coded extn each 
time a value is written into coded_data[7:0]. Software is responsible tor setting coded.extn to 0 before the last word 
of any Token is written to coded_datar7:0]. . 
For example, a DATA Token is started by writing 1 into coded_extn and then 0x04 into coded_dataf7:0]. The start of 
this new DATA Token then passes into the system for processing. 

Each time a new 8 bit value is written to coded_data[7:0). the current Token is extended. coded_extn need only be 
accessed again when terminating the current Token (for example, to introduce another Token). The last word of the cur- 
rent Token is indicated by writing 0 to coded_extn followed by writing the last word of the current Token mto 

coded data[7:0]. , 

Moreover, each time before writing to coded_data[7:0]. codedjwsy should be inspected to see if the .nterface is 

ready to accept more data. 
Switching between Input Modes 

Provided suitable precautions are observed, it is practical to dynamically change the data input mode. In general, the 
transfer of a Token via any one route should be completed before switching modes. These switching modes are shown 
in Table 78. 
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Previous Mode Next Mode 



10 



15 



20 



25 



Byte 



Token 



MP! input 



Token 



MP! input 
Byte 



MPI input 



Byte 



MPI input 



Behavior 



The on-chip circuitry will use the last byte supplied 
in byte mode as the last byte of the DATA Token 
that it was constructing (i.e., the extension bit will 
be set to 0), Before accepting the next Toke n. 

The off-circuitry supplying the Token in Token 
mode is responsible for completing the Token 
(i.e.. with the extn bit of the last byte of 
information set to 0), Before selecting byte m ode. 

Access to input via the MPI will not be granted 
(re., coded_busy will remain set to 1) until the off- 
chip circuitry supplying the Token in Token mode 
has completed the Token (i.e.. with the extension 
bit of the last byte of information set to 0) 



The control software must have completed the 
Token (i.e. withthe extension bit of the last bvte of 
information set to 0) before enable mpi input is 
set to 0. - K 



Table 78. Switching Data Input Modes 



30 



35 



40 



45 



Rate of Accepting Coded Data 

srrrzs r 0 s „r^ isssssssr ^ s*- aaa in ,he mta 

Coded Data Interface Timing 

Similarly, Table 79 shows the coded data interface timing for the present invention. 



50 



55 
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Num. 


Characteristic 


27 MHz 


Unit 


Note 


Min. 


Max. 


1 


CDCLOCK cycle 
time 


37 




ns 




2 


CDCLOCK low 

time 


17 




ns 


• 


3 


CDCLOCK high 
time 


17 




ns 




4 


CDACCEPT drive 
time 




23 


ns 


b 


5 


CDACCEPT hold 
time 


2 




ns 




6 


Input signal set-up 
time 


5 




ns 




7 


Input signal hold 
time 


0 




ns 





Table 79. Coded Data Interface Timing 



• These timings need not be observed in some circumstances. 
6 Maximum signal loading is 20 pF. 

The coded data interface uses CMOS levels. 
CDCLOCK 

The transfer of data across the coded data interface is controlled by CDCLOCK which may be synchronous to the main 
video decoder clock (SYSCLOCK). This facility may be useful in allowing the system decoder to operate on a different 
clock to the video clock. 

However, CDCLOCK is also used internally in the present invention to clock circuitry such as the start code detector. 
Since CDCLOCK does not have the benefit of a Phase Locked Loop (PLL) to ensure even mark-space ratio, external 
circuitry must be used to ensure this or the timing parameters 2 and 3 shown in Figure 105. 

In situations where CDCLOCK and SYSCLOCK do not need to be synchronous, the facility exists to drive the internal 
circuitry such as the start code detector from the PLL rather than CDCLOCK. This frees the external circuitry from the 
need to guarantee the even marked -spaced ratio. 

Figure 106 shows the internal arrangement which allows the even mark-space ratio clock generated by the PLL to be 
routed to the start code detector in place of CDCLOCK. 

If un_named_register is,:0 (reset condition), the start code detector is docked from the PLL. In this case, both 
CDCLOCK and SYSCLOCK must be connected to the same signal. The AC timing requirements for SYSCLOCK. 
If un_named_register is Vthe start code detector is clocked using CDCLOCK. In this case, CDCLOCK may be syn- 
chronous to SYSCLOCK. CDCLOCK must obey the timings as specified in Figure 105. 

Introduction 

The video output interface of the invention implements a digital output interface that complies to CCIR Recommenda- 
tions 601 and 656. All of the synchronization and blanking information is included, in the form of special code words 
(SAV and EAV), in the same byte-wide stream of data as the video information. 
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In addition separate sync and banking pins are provided so that the system may be connected directly to a wide ranae 
of deuces (such as vdeo DACs or NTSC encoders). The timing of these signals is suitable for the genLtion of a S 
signal that complies with CCIR Recommendation 624 yeneranon 0 ? a wjeo 

.^whth^t! 3 ,^ ^ time - r " ul «P lexed on a si "9'e byte-wide bus. Alternatively, a sixteen bit output mode is provided 
^oTsJ^Z^etl ' S "** ° n ° nS ^ Wide ^ Wh ' le cotor d ^ e - Sgnafs are time ^ 

Video Output Signals 

Table 80 provides the signals for the video output interface, in accordance with the present invention. 



Name 

Y[7.0) 
C(7:0] 

HCSYNC 



VSYNC 



CBLANK 



YE 



O 



Luminance output data 
Cr/Cb output data 

Horizontal or composite sync. The microprocessor reaister 
hs_not_cs controls which sync is present on this pin 

The register hcsync.ah controls the polarity of this signal. 
Vertical sync. 

The register vsync_ah controls the polarity of this signal. 
Composite blanking. 

The register cblank,ah controls the polarity of this signal. 



When sampled high at the rising edge of SYSCLOCK the Y 
(and in 16 bit mode the Cr or Cb) data is valid. 
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CB/CR 


O 


In 16 bit mode this signal indicates which color component 
(Cr or Cb) is present on the C[7:0] pins when YE is sampled 
high. 

In 8 bit mode the signal indicates which color component (Cr 
or Cb) is present on the Y[7:0] pins when YE is sampled low. 


V16/8 


I 


Used to select the 16 or 8 bit output modes. 16 bit mode is 
selected when V16/8 is HIGH. 8 bit mode is selected when it 
is LOW 


NTSC/PAL 


I 


Selects which of two standard rasters are to be produced. 
When NTSC/PAL is HIGH, a 525-line raster is produced. 
When it is low, a 625 line raster is produced. 

Note that this pin also affects other aspects of the operation of 
the present invention. 


VTGRESET 


I 


This signal may be asserted to reset the on-chip Video Timing 
Generator. This may be used to lock the video timing to 
some external constraint. 



Table 80: Video Output Interface Signals 



Figure 107 shows the output timing in 16 bit mode. Figure 108 shows the output timing in 8 bit mode. 
Video Output Control Registers 

Video output control registers, in accordance with the present invention as shown in Table 81 . 



Addr (Hex) 


Bit no. 


dir/reset 


Register name 


Description 


18 


7:0 


RW/ 
OxCO 


border_cb 


Cb component of border color 


19 


7:0 


RW/ 
0x80 


border_y 


Y component of border color 


1A 


7:0 


RW/ 
0x40 


border_cr 


Cr component of border color 



125 



EP 0 891 094 A2 



"O 
< 



1B 



IB 



o 
c 

CD 



Register name 



RO/x 



RW/0 



vblank 



blank screen 



RW/0 



enc!_sav_eav 



RW/0 



b!anking601 



RW/0 



cblank ah 



Description 



This is a read-only bit (data written to this bit 
is ignored). It indicates ve rtical blanking. 

When set to 1 . this bit causes border color to 
be painted over the entire screen thereby 
blanking the screen Note that decoding 
continues as normal, but the decoded 
pictures are rendered invisible 



Controls the generation of SAV and EAV 
control words in the output stream. 

0 suppresses SAV and EAV, in which case 
blanking values are output at the times when 
SAV and EAV would otherwise be generated. 

1 enables SAV and EAV. Note that 
blanking601 should also be set to 1 to avoid 
the value zero appearing at the output except 
during SAV and EAV ' K 

For CCIR 601 data, this pin must be set to 1 

Controls the value of luminance* data that is 
output during blanking. 

0 selects the value zero 

1 selects the value 0x10 (sixteen) 
For CCIR 601 data, this pin must be set to 1. 



Controls the polarity of the CBLANK pin. 
0 selects active low 



1 selects active high 
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Addr(Hcx) 


Bit no. 


dir/reset 


Register name 


Description 




2 


RW/O 


vsync_ah 


Controls the polarity of the VSYNC pin. 

0 selects active low 

1 selects active high 




1 


RW/O 


hcsync_ah 


Controls the polarity of the HCSYNC pin. 
.0 selects active low 
1 selects active high 




0 


RW/O 


hs_not_cs 


Controls whether horizontal sync or 
composite sync is present on the HCSYNC 
pin. 

0 selects composite sync 

1 selects horizontal sync 


1C 








(VUP sample mode) 



Table 81: Video Output Control Registers 



Irrespective of the setting of this bit chrominance data (both Cb and Cr) will be 0x80 
(128 decimal) during blanking. 



Borders, Scaling and Cropping 

The present invention attempts to always produce a picture for display that is 720 pels by either 480 lines (525 line 
raster) or 576 lines (625 line raster). The invention automatically scales the decoded picture in order to attempt to fill 
this area. 

Since only a limited number of scale factors are supported, it will not always be possible to fill this area precisely. If the 
resulting picture is too small, then a border will be painted around the decoded picture. This border will be such that the 
decoded picture is in the center of the screen. 

Conversely, if the scaling produces a picture that is too big, then the picture is cropped to enable it to be displayed prop- 
erly. The displayed region is the center of the decoded picture. This cropping is limited so that not more than approxi- 
mately 10% of the decoded picture is cropped. If more than this would be lost then a smaller scaling factor is used. 
The border color may be selected by writing to the registers border_cb, border_y and border_cr. After the device is 
reset and before any pictures have been decoded, the entire screen will be filled with the border color. In addition, it is 
possible to paint border color over the entire screen by writing to blank_screen. This may be used to hide the video 
during, for instance, a channel change. 

Video Output Characteristics 

Characteristics 

Figure 109 illustrates, in accordance with the present invention, the timing of the video output interface. Similarly, Table 
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82 illustrates the video output interface timing. 



w 



15 



20 



a 

b 



Num. 



27 MHz 




Table 82: Video output interface timing 
Maximum signal loading is 50 pF 



25 



30 



35 



Table 83 defines video output mode signals. Figure 110 shows the video output mode signals. 



Characteristic 
Setup before first clock after reset 



27 MHz 


Unit 


Note 


Min. | 


' Max. 


| 5 




ns j 


a I 



Table 83: Video Output Mode Signal: 
Operation is undefined if NTSC/PAL or V16/8 change state after reset 



40 



45 



50 



55 



Video Signal Timing 

^en^JL^ 9 ° f Pr6Sent ,nVentl ° n IS SUCh M me reSU,tln9 vldeo 0utput com P ,ies witn tne fo,, ^ n g CCIR rec- 

CCIR Recommendation 601 
CCIR Recommendation 656 
CCIR Recommendation 624 

Horizontal Timing 

ISiSSS,*"*^ ' S Sh0Wn in ! i9Ure 1 1 1 • THS nUmbefS are in SYSCL OCK cycles for the 525 line system (625 line 
system (625 line system in parentheses). v 

During equalization, the HgYNg signal is LOW for 62 cycles (66 cycles in the 625 Hne system) 

During field synchronization, the H57RC signal is LOW for 732 cydes (738 cydes in the 625 line system). 

Vertical Timing 

t?m TEUE 0 iS "f ^ 5 R9Ure 1 1 2 for * he 525 ,ine < NTSC > s y stem and R 9"^ 1 1 3 tor the 625 line (PAL) sys- 
tem. In these drawings the numbers down the left hand side provide the line number as per CCIR Rec. 656. The two 



128 



EP 0 891 094 A2 

columns at the right provide the T" and M V bits to be found in the SAV and EAV codes (see CCIR Rec. 601). 
The smaller numbers in the center of the thick, solid, black lines provide the logical line numbers of the decoded MPEG 
picture. These are. therefore, numbered 0 to 479 for the 480 line used in the 525 line (NTSC) system and 90 to 575 for 
the 576 lines used in the 625 line (PAL) system. 

Figure 1 14 shows the timing of the sync and blanking pins for the 525 line system and Figure 115 for the 625 line sys- 
tem. Note that only one of R5YRC or C5YRC may be output (see hs_not_cs) and that the polarity of each of these 
signals may be inverted (see cblank_ah, etc.). 



VTG Reset State 

W in the invention, the VTG resets to the start of line 4 for the 525 line (NTSC) system and to the start of line 1 for the 625 
line (PAL) system. 

Introduction 

15 A standard byte wide microprocessor interface (MPI) is used in accordance with the present invention. The MPI oper- 
ates synchronously to the various decode chip clocks. 

MPI Signals 

20 

Table 84 depicts the MPI interface signals. 



S Signal Name 


Type 


Description 


25 


ME[1:0] 


Input 


Two active low chip enables. Both must be 
low to enable accesses via the MPI. 


30 


MRW 


Input 


HIGH indicates a read from a register on the 
system LOW indicates a write to a register 
on the system. 

This signal should be stable while the chip is 
enabled. 


35 


MA[5:0] 


Input 


Address specifies one of the locations in the 
chip's register map. 


40 






This signal should be stable while the chip is 
enabled. 

• 


MD[7:0] 


Output 


8 bit wide data I/O port. These pins are high 
impedance if either enable signal is HIGH. 


45 


IRQ 


Output 


An active low, open collector, interrupt 
request signal. 



Table 84. MPI Interface Signals 



50 

MPI Electrical Specifications 
DC Characteristics 

55 

See 2.2.1 , "TTL (5 V) levels/ 

Figures 123 and 124 illustrate the read and write timing of the MPI, respectively. 
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AC Characteristics 

Table 85 shows the Read Timing for the MPI. 



w 



15 



20 



25 



30 



35 



40 




14 



15- 
16 
17 
18 



19 



20 



Enable low period 
Enable high period 




Address or rw se t-up to chip enable 
Address or rw h old from chip disable 
Output turn-on time 
Read data access time 



Read data hold time 



Read data turn-ff time 



0 
0 

20 



I 



70 



20 



ns 
ns 
ns 
ns 



ns 



Table 85. Microprocessor Interface Read Timing 

The choice, in this example, of M^0] to start the cycle and MB 11 to end it is arbitrary 
These signals are of equal status. y ' 
The access time is specified for a maximum load of 50 pF on each of MDr701 Laroer 
loads may increase the access time y 



Likewise Table 86 shows the write timing for the MPI. 



Num. 


Characteristic 


Min. 


Max. 


Unit 


Notes 


21 - 


Write data set-up time 


15 




ns 


• 


I 22 


Write data hole time 


0 




ns 



Table 86. Microprocessor Interface Write Timing 

The choice in this example, of enable(0] to start the cycle and enable [1] to end it is 
arbitrary. These signals are of equal status. 



45 



Interrupts 

so 



55 

Condition Event Register 

The condition event register is a one bit read/write register whose value is set to one by a condition occuning within the 
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circuit. The register is set to one even if the condition only existed transiently. The register is then guaranteed to remain 
set to one until the user's software resets it or the entire chip is reset. 

The register is set to zero by writing the value one 
Writing zero to the register leaves the register unaltered. 
• The register must be set to zero by user software before another occurrence of this condition can be observed. 
The register will be reset to zero up on reset. 

Condition Mask Register 

The condition mask register is a one bit read/write register which enables the generation of an interrupt request if the 
corresponding condition event register(s) is (are) set. If the condition event is already set when 1 is written to the con- 
dition mask register, an interrupt request will be issued immediately. 

The value 1 enables interrupts. 

The register clears to zero upon reset. 

Unless stated otherwise, a block will stop operation after generating an interrupt request and will re-start soon after 
either the condition event or the condition mask register are cleared. 

Event and Mask Bits 

In the present invention, event bits and mask bits are always grouped into corresponding bit positions in consecutive 
bytes in the register map (see Table 55). This allows interrupt service software to use the value read from the mask reg- 
isters as a mask for the value in the event registers to identify which event generated the interrupt. 

The Chip Event and Mask 

The present invention has a single "global" event bit that summarizes the event activity on the chip The chip event reg- 
ister presents the OR of all the on-chip events that have 1 in their mask bit. 

A 1 in the chip mask bit allows the chip to generate interrupts. A 0 in the chip mask bit presents any on-chip events gen- 
erating interrupt requests. 

Writing 1 or 0 to the chip event has no effect. It will only clear when all the events (enabled by a 1 in their mask bit) have 
been cleared. 

The IRQ Signal 

The IRQ signal in the invention is asserted if both the chip event bit and the chip event mask are set. The TRQ signal is 
an active low, "open collector" output which requires an off-chip pull-up resistor. When active the IRQ output is pulled 
down by an impedance of 100 il or less. A pull-up resistor of approximately 4 kn should be suitable for most applica- 
tions. 

Page Register 

In order to reduce the number of register address signals required by the present invention, a page register is employed 
to enable more than 64 registers to be addressed. This page register is at location 0x1f. Register locations 0x00 to 0x1f 
are not affected by the contents of the page register and are always present in the register map. Registers in locations 
0x20 to 0x3f depend on the page register. 

There are no paged registers that are required for normal device operation. The paged registers are, finally, only used 
for test purposes. 

In the invention, the page register is reset to the value zero. The user should ensure that no other value is written to this 
register. 

Introduction 

SDRAM Interlace Signals 

Table 87 illustrates the SDRAM Interface Signals 
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Sional Namp 


I i ype 


Description i 


DDM5 01 


I/O 


Ua\a pinS 


DAflOO] 




Muuress pins 


BS 


o 


DanK select, utten this is labeled as AM 1 on 16 Mbit 
SDRAM parts 


DCKE 


1 


Clock enable 


DCLKOUT 


0 


SDRAM clock output. 


DCLKIN 


1 


Connect to DCLKOUT 


DWE 


0 


Write enable 


DCAS 


0 


Column address 


DRAS 


0 


Row address 


DCS[1:0] 


o 


Chip select. DCS(0] selects the first "bank" of SDRAM If a 
second "bank" is used (see SDRAM configurations 1 and 2) 
then DCS[1] is also used 



Table 87. SDRAM Interface Signals 



SDRAM Configurations 

Table 88 illustrates SDRAM configurations. 



Configuration 


SDRAM Packages 


Total DRAM 


I Organization 


0 


1 


16 Mbit 


16 Mbit, 1 M by 16 bits 


1 


2 


20 Mbit 


16 Mbit, 1 M by 16 bits 








4 Mbit, 256 k by 16 bits 


2 


2 


32 Mbit 


16 Mbit, 1 M by 16 bits 








16 Mbit, 1 M by 16 bits 


3 


2 


32 Mbit 


16 Mbit. 2 M by 8 bits 



Configuration 


__SDRAM Packages 


Total DRAM 


I Organization 








16 Mbit. 2 M by 8 bits 



Table 88. SDRAM Configurations 



132 



EP 0 891 094 A2 



10 



15 



20 



Configuration Zero 

See Figure 116 for the Configuration Zero SDRAM Connection. 

Figure 117 illustrates the configuration for one SDRAM connection. Similarly, Figures 118 and 119 depict a configura- 
tion of two and three SDRAM connections, respectively. 

Introduction 

The system, in accordance with the present invention, fully supports the Joint Test Action Group (JTAG) "Standard Test 
Access Port and Boundary Scan Architecture", now adopted by the IEEE as standard 1 149.1 . 
All JTAG operations are performed via the Test Access Port (TAP), which consists of five pins. The TREST (Test Reset) 
pin resets the JTAG circuitry to ensure that the device doesn't power-up in test mode. The TCK (Test Clock) pin is used 
to clock serial test patterns into the TDI (Test Data Input) pin, and out of the TDO (Test Data Output) pin. Furthermore, 
the operational mode of the JTAG circuitry is set by clocking the appropriate sequence ol bits into the TMS (Test Mode 
Select) pin. 

The JTAG standard is extensible to provide for additional features at the discretion of the chip manufacturer. In accord- 
ance with the present invention, there are 9 user instructions, including three JTAG mandatory instructions. The extra 
instructions allow a degree of internal device testing to be performed, and provide additional external test flexibility. For 
example, all device outputs may be made to float by a simple JTAG sequence. See Table 89. 

Connection of JTAG Pins in Non-JTAG Systems 



25 



Signal 


Direction 


Description 


TRST 


Input 


This pin has an internal pull-up, but must be 
taken low at power-Up even if the JTAG features 
are not beinq used This may be achieved by 
connectinq TRST in common with the chip reset 
pin RESET. 


TDI 


Input 


These pins have internal pull-ups, and may be 
left disconnected if the JTAG circuitry is not 
being used 


TMS 


TCK 


Input 


This pin does not have a pull-up. and should be 
tied to ground if the JTAG circuitry is not used. 


TDO 


Output 


High impedance except during JTAG scan 
operations. If JTAG is not being used, t his pin 
may be left disconnected. 



30 



35 



AO 



45 



Table 89 How to Connect JTAG Inputs 



50 



Supported Instructions 

This section describes the instructions supported in this implementation of the present invention. See Tables 90, 91 and 
92. 



55 
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I nstruction 
EXTEST 



SAMPLE/ 
PRELOAD 



BYPASS 



This is the most basic instruction. It applies data from the boundary 
scan chain to the PCB. and captures the response. It has a pre-defined 
instruction code, which is all-0's in the instruction register 



fT^ £ str " ct,on . a,lows the boundary-scan cha.n to be parallel-loaded 
from the dev.ce s pins, and shifted, without the boundary-scan chain 
being switched in, i.e. transparently to system operation Bv this 
means, a 'snapshot" of the state of the device's pins may be taken 
(external clock control required to avoid mestastability) or the 

S 0 ""^ 5 ? cnain m ay t>e pre-loaded before switching over into 
cA i fcoi mode. 

The instruction code for SAMPLE/PRELOAD may be chosen by the 
manufacturer. ' 

This instruction selects the 1 -bit bypass register, to by-pass the 
boundary scan chain and thus reduce the length of bit-stream required 
to access other devices on the PCB The .nstruct.on code is pre- 
defined as all-1's H 



Table 90. Mandatory Instructions 




I nstruction 
INTEST 



This does the reverse of EXTEST' . i.e. applies data from the 
boundary-scan chain to the chip core, and captures the response 
The instruction code may be chosen by us It is up to the user to 
devise suitable tests to make use of this capa bility 

Table 91. Optional Instructions That Are Supported 



The following optional JTAG instructions are not supported: 



1) IDCODE 

2) RUNBIST 
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Instruction 


Description 


- 

5 
10 


FLOATBS 


This instruction pre-sets the Boundary-scan register to contain T in 
all open-dram cells and '0* in all others The system operation is not 
affected Since a '0' in an output cell causes the output to float, this is 
a quick way of disabling all outputs (a common requirement for PCS 
lAP+ir-tsiN Tho m limits will not float until an instruction is loaded which 
switches in the Boundary-scan chain, e.g. EXTEST. (If FLOATBS 
were to switch in the boundary-scan chain itself, unknown data would 
be driven out of the pins until the UPDATE_DR state.) 


15 


INEXTEST 


Does the combination of INTEST and EXTEST. Perhaps not very 
u«;pfiji as we have individual versions anyway. It may allow some 
users to devise a faster combined PCB/chip test. Many JTAG 
devices use this combined mode rather than separate versions. 


20 


SETBYP 


Selects the Bypass register between TDI & TDO. but switches the 
3oundary-scan chain in. This allows the PCB test to set up a 
. nncfa «« nattf»rn nn one device's Dins but still access other device's 
pins without having to reload the first device. The name is consistent 
with the same function in Texas Instrument's "Scope" JTAG devices. 


25 


SHIFTBN 


Like SAMPLE/PRELOAD, but without the SAMPLE operation. 
Allows the current Boundary-scan contents to be shifted some more, 
without being overwritten. T.I. have this instruction in their Scope 
devices, but variously call it READBN or RBRNNT neither of which is 
very intuitive. 


30 
35 


SHIFTBT 


Like SHIFTBN. except that the Boundary-scan chain is switched in. 
Potentially more useful than SHIFTBN. in that it could be used for 
optimizing PCB test patterns for small bits of logic externally 
connected between JTAG devices. E.g. for a 2-input gate near the 
far-end of the chain, several test patterns could be queued-up in the 
Boundary-scan chain, and applied in turn. EXTEST, in contrast, 
overwrites the Boundary-scan contents on each scan cycle. 



Table 92. Additional Public Instructions 



AO 

Allocation of Instruction Codes 

There are 14 defined instructions altogether. Hence there is a 4-bit long instruction register, with 2 unassigned instruc 
45 tions. Unassigned instructions are aliases of the BYPASS instruction, in accordance with IEEE1 149.1 . 
The full list of instructions and their codes is shown in Table 93. 



50 
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10 



15 



20 



25 



30 



35 



0011 



Code Instruction 



Register 
shifted 



0000 | EXTEST 

0001 I SAMPLE/ 
PRELOA D 

0010 I INTEST 



FLOATBS 
SHIFTBT 
SHIFTBN 
INEXTEST 

unassigned 
PRIVATE 



1001 PRIVATE 



SPDATAT 
SPDATAN 
SETBYP 

unassigned 



BYPASS 
BYPASS 



B/Scan 

B/Scan 

B/Scan 

B/Scan 
B/Scan 
B/Scan 
B/Scan 
Bypass 



ScanData 
ScanData 
Bypass 
Bypass 
Bypass 
Bypass 



Signals 
capture 



InputPads / 
0's 

All Pads 



0*s / 

OutputPads 
0's 

No change 
No Change 
All Pads 



B/SCAN 
register 



Class 



switched m MANDATORY 



transparent MANDATORY 



switched in RECOMMENDED 



transpare nt | PUBLIC 
switched in I PUBLIC 
transparent I PUBLIC 



switched in 
transparent 



switched in 
transparent 



transparent 
transpar ent 

Table 93. JTAG Instruction Codes 



switched in 
transparent 



PUBLIC 



RESERVED 



PRIVATE 
PRIVATE 



PUBLIC 
RESERVED 



PUBLIC 



MANDATORY 



40 



Level of Conformance to IEEE 1149.1 



Rules 



45 ALL rules are adhered to, although the following should be noted: 



Rules 

so 3.1.1(b) 
3.5.1(b) 




Description 



The TRST pin is provided 

Guaranteed for all public instructions (see IEEE 1 149 1 5 ? 
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Rules 



5.2.1© 



5 3.1(a) 
6.2 l(e.f) 

7.1.1(d) 
7.2.1(c) 
7.8.1(b) 



7.9 1( ) 
7.11. 1(.. ) 



7.12.K . ) 
8.1.1(b) 
8.2.1(c) 



8.3.1(d-i) 



10.4 1(e) 



10.6.1(c) 



Description 



Guaranteed for all public instructions For some private 
instructions, the TDO pin may be active during any of the states 
Capture-DR. Exit1-DR & Pause-DR 



Power on-reset is achieved by use of the TRST pin 



A code for the BYPASS instruction is loaded in the Test-Logic- 
Reset st ate. 

Un-allocated instruction codes are equivalent to BYPASS^ 

There is no device I D register. 

Single-step operation requires exter nal control of the system clock 
There is no RUNBIST facility 
There is no IDCODE instruction 



There is no USERCODE instruction 



There is no device identification register. 

Guaranteed for all public instructions. The apparent length of the 
path from TDI to TDO may change under certain circumstances 
while private instruction codes are loaded 

Guaranteed for all public instructions Data may be loaded at 
times other than on the rising edge of TCK while private 
instructions codes a re loaded 

During INTEST. the system clock pin must be controlled externally 



During INTEST, output pins are controlled by data shifted in via 
TDI 



Table 94. JTAG Rules 



Recommendations 




3.2.1(b) 
3.3.1(c) 



TCK is a high-impedance CMOS input 
TMS has a high impedance pull-up 



3.6.1(d) 



(Applies to use of chip) 



37.1(a) 



(Applies to use of chip) 
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Recommendations 


Descnotion 1 


6.1.1(e) 


The SAMPLE/PRELOAD instruction code is loaded durinq 
Capture-IR. * 


7.2 1(f) 


The INTEST instruction is supported. 


77.1(g) 


Zeros are loaded at system output pins during EXTEST 


77.2(h) 


All system outputs may be set high-impedance. 


78.1(f) ^ 


Zeros are loaded at system input pins during INTEST. 


B.1.1(d.e) 


Design-specific test data registers are not publicly accessible. 



Table 95. Recommendations Met 



Recommendation 


Description 


10.4 1(f) 


During EXTEST. the signal driven into the on-chip logic from 
the system clock pin is that supplied externally 



Table 96. Recommendations Not Implemented 



138 



Permissions 



EP0 891 094 A2 



Permissions 


Description 


3 2 1(c) 


Guaranteed for all public instructions 


6.1.1(f) 


The instruction register is not used to capture design-specific 
information. 


7.2.1(g) 


Several additional public instructions are provided. 


7 3.1(a) 


Several private instruction codes are allocated. 


7.3.1(c) 


(Rule?) Such instructions codes are documented. 


7.4.1(f) 


AHHitinnai rnriPQ nprfnrm identicallv to BYPASS 


10 1.1(i) 


Each output pin has its own 3-state control. 


10.3 1(h) 


A parallel latch is provided 


10.3.1(ij) 


During EXTEST. input pins are controlled by data shifted in 
via TDI 


10 6.1(d,e) 


3-state cells are not forced inactive in the Test-Logic-Reset 
state^ — 



10 



15 



20 



25 



Table 97. Permissions Met 



30 



35 



Introduction 

The start code detector (SCD). in accordance with the present invention, has the task of detecting start codes in the 
coded data stream. It converts these to Tokens for further internal processing by the system. 
In addition to this task there are a series of features that support, for example, channel change. 



Start Code Detector Registers 
40 Table 98 illustrates the registers for the start code detector of the present invention. 



45 
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CO 
CD 



Register Name 



7 



RW/0 scdpaccess 



(not used) 



Description 



This bit must be set to one before 
the values m register location 
0x07 may be written to reliably 
This causes the SCD to stop 
processing data so that there is 
never any contention between the 
microprocessor access and any 
attempt by the SCD to modify the 
registers itself. 

Once the value one has been 
written to scdp access, the 
microprocessor must poll 
scdp_access and wait until it 
reads back 1. 

Once the required accesses have 
been made to location 0x07 the 
value 0 should be written to' 
scdp_access to enable the SCD 
to continue processing data. 
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Register Name 



RW/1 



10 



15 



discard_extension 



20 



RW/1 discardjjser 



25 



30 



35 



40 



RW/O after_se archest op 



2 | RW /0 flag_picture_end 
RW/O after_picture_stop 
RW/O after_picture_discard 



45 



07 7:3 



(n ot used) 
RW/O ~1 discard_all 



Description 



When discard_extension is 1. 
any extension data that is not 
recognized as MPEG-2 MP@ML 
is discarded at the start code 
detector When it is 0. such 
extension data is passed through 
the coded data buffer to the 
parser. 

With the standard microcode, 
there is no point in setting 
discard_extension to 0. 



When discard user is 1, any 
user data is discarded at the start 
code detector. When it is 0, used 
data is passed through the coded 
data buffer to the parser. 

Whilst facilities exist to handle 
small amounts of user data at the 
parser care must be exercised if 
discardjjser is set to 0. Note 
that the system cannot deal with 
arbitrary amounts of user data. 



Used m conjunction with the 
start code_search facility. 



This is set to 1 to enable the 
flag pi cture„end facility. 

Used in conjunction with the 
flaq_pict ure_end facility. 

Used in conjunction with the 
flag_picture_end facility 



This is set to 1 to enable the 
discard j*H facility. 



so 
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X 
CD 



< 



C 

c 

CC 



Register Name 




10 



00 



RW/0 



RWVO 



RW/O 



Description 



start_code_search 



A non-zero value in this register 
enables the start code search 
facility. See 8 .5 on page 84. 

(not associated with the start code de tector! 
end_search_event 



RW/0 



unrecognized_start_event 



This bit is set whenever a 
start_code_search is satisfied 
If end_search_mask is also set 
to 1 then an interrupt will be 
generated. 6 







1 3 0 




35 


01 


7 








6 


RW/0 


40 




5 


RW/0 






4 


RW/0 


45 




3:0 





flag_picture_end_event 



This bit is set whenever an 
unrecognized start code is 
detected If 

unrecognized_start mask is 

also set to 1, then anlnterrupt will 
be generated. 



This bit is set whenever the end 
of a picture is detected and 
flag_pjcture_end= 1 if 

flag_picture]>nd_mask is also 
set to 1 then an interrupt will be 
generated See 8.4 on page 82. 
(not associated with the start code detector) " " — 



(not associated with the start co de detector) 
end_search _mask 

unrecognized_start_mask 
flag_picture_end_mask 



See end^search^even t above 

See unrecognized_start event 
above. " 

See flag_picture_end event 

above. 



Table 98: Start code detector registers 
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• event bits are not simple R/W reg.ster bits 

all .nterrupts are cond.t.onal on chip_mask be-ng set to 1 
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Detection of Start Codes 

The start code detector of the present invention will only detect start codes that are correctly byte aligned. 
The present invention deals only with video start codes. Unrecognized start codes are detected and cause an 
unrecognized_start_code event. The unrecognized start codes are the system start codes (with values 0xb9 through 
Oxff) the reserved start codes (OxbO. 0xb1 , and 0xb6) and the sequence_error_code (0xb4). 



discard_all Facility 

The discard_all facility may be used to discard all data that enters the system. It is possible to select the discard_all 
facility "manually" by setting the register discard_all to 1. However, it is necessary that scdp_access must first be set 
to 1 and then polled until it reads-back 1. Generally, it is typical to enter this mode automatically as part of the flag- 
plcture_end facility. 

The present invention will continue to discard all data until either the value 0 is written to discard_all or a FLUSH Token 
is encountered Note that FLUSH Token that the resets discard_all is deleted from the stream of tokens and does not 
affect the parser or any subsequent blocks of circuitry. 



flag_picture_end Facility 

The flag_picture_end facility, in accordance with the present invention, is intended to allow a clean termination of 
decoding by waiting until the end of a picture before stopping the flow of data into the system. The parser, therefore, will 
see no incomplete pictures. 

Figure 120 illustrates as a flow chart the flag_picture_end facility. As shown, it is possible to generate an interrupt 
(f lag _picture_end_event) when the end of the picture is detected. This may cause the SCD to stop processing data 
until the interrupt is serviced. Alternatively, the SCD may be allowed to proceed. 

If after-picture_discard is set to 1 . then after the end of the picture is detected, all subsequent data will be discarded. 
This is most useful for discarding the trailing data from one channel that is M in flight" in the system demultiplexor prior to 
a channel change. 

Note that the start_code_search facility in this embodiment takes priority over f lag _picture_end facility. In this way. 
the data that is being discarded due to the start_code_search is not examined to determine whether the end of a pic- 
ture has been reached. 



start_code_search Facility 

In the invention, the SCD can be set to search for specific types of start codes. This may be used, for instance, 
channel change to search for a sequence start code before decoding commences. 



start code search 


Start codes that end the search 


0 


(none - normal operation) 


1 


picture_start_code, group_start_code 
and sequence_start_code 


2 


group_start_code and 
sequence_start_code 


3 


sequence_start_code 



Table 99. start code search Modes 



The search mode is entered by writing a non-zero value into start_code_search. The start code detector will then 
search for the appropriate start codes as indicated by Table 99. All data and Tokens are discarded while the search con- 
tinues. When one of the appropriate start codes is encountered, the search ends, start_code_search is set to zero and 
an interrupt may optionally be generated. 

Note also that a FLUSH Token will terminate the search as if one of the indicated start codes had been encountered. 
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^ Rgure 121 illustrates as a flow chart the star1_code_search facility, in accordance with the present invention.. 
SCD Example - Channel Change 

SCD fedU,ie5 in ■—»•»«- — Q — « « aotas *c e»ec . 

1 in to f tag_picture_end 
15 • 1 in to after _j>icture_discard 

• 1 in to f lag_picture_end_mask 

LTlt* "ST ^ deteCt ° r det6CtS thS 6nd ° f the plcture ' » Mediately starts to discard all subse- 

• 3 (search for sequence_start) into start_code_search. 

• 1 to f lag _picture_end_event (to clear the event) 

?5 3) Then the microprocessor retunes the tuner to select the new channel 

deSST set ' oence start code is delected, the start code detector ceases discarding data and reeomea norma. 



35 



40 



45 



50 



Introduction 



^^^JSKSESr 10 inKr,cl wah 0,9 ^ - si ™ ~* m * 

I^Tk !? 18 * We 10 — «*"ng microprocessor when it delects untmial o, une^clod evotts 

d^meZ-tn^n 5 °' WB ^™^ mM - d -»« mi "^^^ ra: odepn«eeeo,.Tneee.,. 



Parser Registers 

The registers used by the parser as shown in Table 100 
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Address (Hex) 


Bit no. 


• 

Dir/resel 


KcClblci .>»dmc 


Description 


i, ^ * 

10 


7 1 


RW 


(parser_ctrl) 


No function allocated 




0 


RW 


parser_continue 


Used in certain situations to indicate to the 
parser whether it should continue with its 
/"itrrAnr artivitv nr rpturn to normal decoding 


11 


7:0 


RW 


parser_status 


Used to indicate the status of the parser in 
certain conditions 


12 


7 0 


RO 


parser_error_code 


This location contains an error code when the 
parser has interrupted and is waiting to be 
serviced This indicates the reason for the 
interrupt ___ 



25 



30 



35 
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15 



20 



25 



30 



35 



40 



45 



X 

X 

V) 

v_ 



o 
c 

55 



0) 

w 

CD 



Register Name 



l 




6 0 R\V 



Description 



parseraccess 



reg_keyhole_nddr 



M 7 0 RW 



reg_keyhole_datn 



15 70 



16 



7 0 RW 



(not used) 
user_keyhole_addr 



17 



7:0 RW 



user_keyhole_data 



00 



The value I must be written to this register to 
enable access to the other parser reeislers The 
controlling microprocessor must then poll this 
bit until it reads back the value I indicating that 
the parser has stopped processing data and can 
be accessed 

Note that as a special case, if the parser is 
stopped waiting for it interrupt to be serviced 
p;irser_error_code may be read without first 
writing I to parser_access 

This register is used to address the location in 
the parsers internal register file that may be 
written to or read from via reg_keyhole_data 
Note that each access (read or write) to" 
regkeyholedata increments 
regkeyhole nddr bv one 

A read from this location actually reads data 
from the parser's register file at the location 
indicated by reg_keyhole_addr Similarly a 
write to this location actually writes to the 
parser's register file at the location indicated bv 
reg_keyhole_addr 



This register is used to address the location in 
the user data RAM that may be written to or 
read from via user_keyhole_data Note that 
each access (read or w rite) to 
user_keyhole_dntn increments 
user_keyhole_nd dr by one 

A read from this location actually reads data 
from the user data RAM at the location 
indicated by reg_keyhoIe_addr. Similarly a 
write to this location actually writes to the user 
data RAM at the location indicated by 
reg_keyhoIe_addr 



so 



(not associated with the parser) 
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Address (Hex) 


Bit no. 


Dir/reset 


Register Name 


Description 




3 


RWVO 


parser_event 


This bit is set whenever the parser detects an 
error condition If parser_mask is also set to l ! 
then an interrupt will be generated. b 




2:0 




(not associated with the parser) 


01 


7:4 




(not associated with the parser) 




6 


RW/O 


parser_mask 


See pnrser_event above 




30 




( not associated uith the parser) 



Table 100. Parser Registers 



J event bits are not simple R W register bits. 

b all interrupts are conditional on chip_mask being set to 1 



Error Codes 

Whenever the parser detects an event condition, it sets parser.event. If parser_mask is set to 1 (indicating that the 
user system is interested in servicing parser events) the parser stops processing and (assuming that chip_mask is set 
to 1) an interrupt is generated . 
On responding to the interrupt the controlling microprocessor should read parser_errt>r_code to determme the cause 
of the event Table 101 provides the complete list of defined error codes in this regard. 

After the controlling microprocessor has responded to the event in the appropriate manner it must allow the parser of 
the present invention to resume processing. This is done by clearing the event by writing the value 1 to parser_event. 



Code 


Name 


Description 




ERR_USER_DATA 


Indicates that user data has been encountered 
and is present in the user data RAM 



Table 101. Parser Error Codes 



Dealing with User Data 

Small amounts of user data may be read out from the parser. By default, all user data is discarded by the start code 
detector. This is to protect the system from the inappropriate use of large amounts of user data wh.ch would be beyond 

its capabilities ... . ± . 

In order to allow user data to reach the parser the register discard.user must be set 0. Whenever user data is encoun- 
tered in the bitstream the bytes of data are buttered up in an on-chip user data RAM. The RAM has space for 1 92 bytes 
of data to be buffered. When all of the bytes of user data have been read (or the RAM is full) the parser will generate 
an event (ERR_USER_DATA) which allows the controlling microprocessor to read the data from the RAM 
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9 



^et^lTnd^ ? m !"°^ 0cessor ™* ** obtain access to the parsers interna, registers by set- 
SJTk - access *° 1 and ,hen P° ,,in 9 this b.t until it reads back 1 . The number of bytes in the user data RAM k I iS 

seSSS iSSSr = msscsk 

Limiting the Amount of User Data 



User Data RAM 



be preserved for later use. 
Introduction 



System Organization 



The present invention includes a counter that is incremented at regular intervals derived from the 27 MHz SYSCLOCK 
^TIi i !T*h^K Perfor ^ te CmeS,amP mana 9 ement ««h respect to its internal time counter denoted "video- 

videotime - modifiedtimestamp = timestamp - time E q -j . 
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modrfiedtimestamp = videotime + (timestamp - time) EQ 2: 

Figure 1 22 shows one possible organization ot the arithmetic to derive the modified time stamp. In reality, it is most likely 
that the actual additions (and the shift) will be performed on a processor rather than in dedicated hardware. There are. 
of course, many other ways to derive the same numeric value of the modified time stamp. For instance rather than .hav- 
ing a copy of the videotime counter, it may be better to simply record the value of "time", when the RESET_TINIE pin of 
the invention was last asserted. From this information and the current value of "time" it is always possible to deduce the 

current contents of video time within the system. 

It will be appreciated that any suitable rearrangement of arithmetic operations that yields a suitable value of the modified 

ATsntw^n^gu'reTa? the modified timestamps used by the invention use only sixteen bits. This is achieved in two 
wavs First since the difference between time and the timestamp (used to derive the modified timestamp - see Equat.cn 
2) should always be small, the more significant bits can be discarded. Second, since the invention only controls the 
presentation of video to the nearest frame-time, the less significant bits are also not required and are discarded by shift- 

Thu^the^ixteln bits of time information maintained are able to deal with timing errors of up to about 1 1 .5 seconds with 
an accuracy of about 180 us (about 1% of a field time). 

Claims 

1 . A parallel Huffman decoder, comprising : 

an input accepting a mixed data stream comprising Huffman coded variable length codes, fixed length codes, 
and data tokens, wherein all complete Huffman codes are received in a single cycle of operation ; 
a selector operative on said data stream, wherein in a first mode of operation variable length codes are pro- 
vided to said address lines of a Huffman Code ROM. and in a second mode of operation data tokens and a 
signal representative of a length of fixed length codes is output by the decoder without reference to said ROM ; 
a pair of input registers for receiving Huffman coded data, both of said registers directing input in parallel to 

said selector ; and nu 
a Human Code ROM for receiving input from said selector and another ROM table select input; said ROM pro- 
viding a first decoded data output, said selector being responsive to a second output of said ROM. 

2. A Huffman decoder as recited in claim 1 . adapted to decode MPEG-1 data. 

3. A Huffman decoder as recited in claim 1 , adapted to decode MPEG-2 data. 

4. A Huffman decoder as recited in claim 1 . adapted to decode both MPEG-1 ad MPEG-2 data. 

5. A Huffman decoder, as recited in my one of claims 1 to 4, wherein one of said pair of input registers handles most 
significant data ad the other of said pair of registers handles least significant data. 

6. A Huffman decoder, as recited in claim 5, adapted to decode variable length codes (VLC). 

7. A Huffman decoder, as recited in claim 5. wherein said ROM forms a VLC table index calculation. 

8. A Huffman decoder, as recited in claim 7, wherein an index-to-data operation follows said index calculation, to pro- 
vide decoded data. 

9. A Huffman decoder, as recited in claim 5, adapted to decode coded coeff icients. 

10. A Huffman decoder, as recited in claim 5, adapted to decode Intra-DC values. 

1 1 . A Huffman decoder, as recited in claim 5. adapted to decode motion vector deltas. 

12 A video decoding and decompression system, including a parallel Huffman decoder as recited in claim 1 , compris- 
ing a memory for "don't care" handling, comprising: a set of memory address lines, inverted address lines and data 
lines said address lines and inverted address lines being connected according to a decoding format to access the 
addressed information in the form of a data word, "donl care" address locations on the data lines being uncon- 
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mation represent — - 

lines, such that a signal is p^STo^Jl «W 1 2h SJ2 7* °^ 9r ° UPS °' 3ddreSS Bnes and inverted address 
activatab.e by one L^^rtod«d?SiSj?5^ T addreSS ' Mid memor y havin 9 a by-pass 
line as a "don , care" in me same g^oup sub-address ■»• as a defined value and another sub-address 

o 

Z each a^^ 5 ffJ 2 " Whe ^ a P* « *** are associated 

connected to a NOR gate representor data to be accessed, each pair of said data lines being 

! 1 5 ' « £EJ l^ZT"^*" * 1 * ^ h 3 "« ™* - aeration toKens 

16 ' S^ty-S^r 1 ^ SYSlem aCC ° rdin9 l ° C ' aim 12 ' fUrth6r «^*» a ■*-* - P^ned 



an inverse modeller circuit ; 
an inverse quantizer ; and 

by saw tension indicators. ,he te^ £££ M*b *«<«->. tam, ANmtM 

17. Apparatos to, feeding a mixed daB s , ream K ^ ( . 

SS""* 3 * P0 ~ 1 " l0Caeons *«* a "° ■ - *- *» address** 

address lines is derined to co»eiSa SmT™SJ^ fSS f 01 Md * ana a sec °" d sel <" 

«ne°dt or'" ^ ** " , "° n ^ tWB 3 ,OCatton h «" « * **» by said activated word- 

means coupled to said outpu, bus for determining the .ength of the variable .ength code, and coup-ed to sa« 

ss^ssr.sas skst de,ector outpu,s a si9na ' representative ° f a ,en9th ° f the — 

18 " a^st^S * 3dd r - ««*~ a firs, ,ine driven in a true sfate 

are coupled to select "f ire ^2221^^?^ ^ W ° rd ' ine driver and second w °*Kne driver 
of saidt, ITZVr^^ZT 3ddreSS ' ineS 3CCOrda " Ce "* *• Pattern of bits 

address line comprises a first ,ine driven in a true state 
are coaled to selected f ZjZS lini^nf ^ ^ J! ' W ° rd " ne driver and «*■ sec °r>d wordline driver 

sa^ f irScode ^KcKKS ^HSSSST" in accordance '* the — " * brts - 
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20 A method of decoding a mixed data stream comprising Huffman coded variable length codes, fixed length codes, 
and data tokens, the stream including a first code having a first length, and a second code hav.ng a second length, 
comprising the steps of : 

memorizing data in a plurality of locations of a memory, wherein said memory has a plurality of address lines 
for addressing said locations, a first set of said address lines and a second set of said address lines being 
defined to correspond with a pattern of bits of said first code and a pattern of bits of said second code respec- 
tively ; 

identifying variable length codes in the data stream ; 

respectively associating each bit of a said variable length code with a said address line ; 
activating said address lines responsive to the bits associated therewith ; 
coupling a first wordline driver to said first set of address lines ; 
coupling a second wordline driver to said second set of address lines ; 

activating a selected one of said first wordline driver and said second wordline driver in response to signals on 
all of their respectively coupled address lines ; 

obtaining data from a location in said memory that is driven by said activated wordline driver ; 
from said obtained data determining the length of the variable length code ; 

according to said determined length, aligning bits of a succeeding variable length code with said address lines 

for association therewith ; # 

identifying fixed length codes in the data stream ; and outputting a decoded value thereof without accessing 

said memory. 

21 The method according to claim 20, wherein each said address line comprises a first line driven in a true state and 
a second line driven in an inverted state, and wherein said first wordline driver and said second wordline driver are 
coupled to selected first and second lines of their coupled address lines in accordance with the pattern of bits of 
said first code and the pattern of bits of said second code. 

22. The method according to claim 21 , further comprising the step of predecoding said address lines prior to perform- 
ing said step of placing a signal thereon. 

23. The method according to claim 20. further comprising the step of predecoding said address lines prior to perform- 
ing said step of placing a signal thereon. 

24. The method according to claim 20. further comprising the steps of : 

identifying tokens in the data stream ; and 
outputting the tokens. 



151 



EP 0 891 094 A2 



O 
o 

0) 



C 



L_ 



O 
CM 



CM 
CM 



A 



o 
> i— 



* — 

o 

zr2 



c 
O 



o E 
o c*. 



o 



sj 









o> 






o 


E 
o 


v_ 


to 


CD 


cx 


> 






. . . 



OJ 
O 

co 



CO 

o 

CM 



c 
o 

■ — 

i5 









E 


CD 


o 


• 

U_ 





r 



Q 

C/) 

o 



o 
c 

CO 



1 "■■ 




CO 


*— 


E 


*£? 


o 






GO 



1 



II 

X 



O O 3 
OOCD 



o 
t> to 

— k_ 
QJ 



CL LZ 



o 

CM 



SI 



Q_> 














> — 


> 






o 





jr 11 — 


. .-1 


l 


i 1 > 
■ 1 I 








H 



tfi 
O 
CX 

to : 











to 


^»~» 


\_ 


c ■ 




o 


> 




c 


O 



to 

CM 



[ 



QJ 
(_> 
O 

QJ 



CM 



J? o 

> CP 

-E Kj 



L 



o 



o 

E 



CM 



I 



CM 



if 



-So* 

GO O O 



r 

L 



•tE-- 



■ 


o 


tn 




CO 




a> 






a> 


x> 


c 




QJ 




O 



CO 



qj a> 

to — 

k_ QJ 

> o 



c 




o 


a> 


JE 


od 




o 








Q 



^ Li? 



i 



1 



o 

CM 



CO 



V^CM 



(3 



o c 

— CM 

o o 



_ J 



QJ 

to 

v_ 

o 

a. 

o 

eu 



O 

CM 



cl a. 



V 
CM 
O 
CM 




152 



EP0 891 094 A2 




153 



EP 0 891 094 A2 




FIG. 3 




LEFT BARREL 
SHIFTER 



FIG. 4 



RICHT BARREL EXTENDER 
SHIFTER 



i 



MULTIPLEXER AND 
EXTENDER 
CONTROL 



CODED ADDRESS(H) 



FIG. 5 



WIDTH 




SUBSTITUTION 




DECODE 




DECODE 






6 


eL 



INDEX 
8 





V 












JUSTIFY 




12 


1' 


J 5 



REGISTER 

FILE 
ADDRESS 



SHIFTER 
CONTROL 



154 



EP 0 891 094 A2 



218 



STATE MACHINE 



ADDRESS SUBSTITUTION 
VALUE 



EVALUATED CONDITION 



INSTRUCTION 



DATA IN 




FIG. 6 



DATA OUT 



REGISTER FILE 
CIRCUITRY 




ADDRESS SUBSTITUTION EVALUATED CONDITIONS 
VALUE 



SHOW MICROCODE CONTROL 



FIG. 8 



155 




156 



EP 0 891 094 A2 



04 




157 



EP 0 891 094 A2 



-A 
V 



V 



T IN[21:0]> 



IN CLK > 



OUT CLK > 



EN BH > 
EN GH > 



ADD > 



way 



IN CLK > 


111 Al 1/ 

IN_CLK 










c\i it n if \ 


OUT CLK 






<21:0] 




LATCH10 > 


LATCH 10 










LATCH64 > 1 


LATCH64 










LATCH32 > 


LATCH32 










LATCH76 > 


LATCH76 










LATCH > 


LATCH 


cd" 


• • 


• » 


• * 


SEL BYP> 


SELBYP 


CM 

'cd' 


CM 


CM 
CM 


CM 
LO 


I SEL P > 


SELP 


CD 


S 

CD 




c3 



MUX OUT70> 
MUX 0UT61> 



MUX OUT43> 



MUX OUT52> 



PREC 





cd 

* * 






cd 

• * 


CD* 










CM 


cm 


cm 


est 


CD 


in 




<o 


o 


CD 


<3 


CD 



PREC 

^ S 

23! 



COMMON 
BLOCK 



CM CM 
CD == 



o 

CM 



CM 



CD CD 

CM CM 

I — t— 

ZD ZD 

CD O 



CD 
CM 



CM 
CM 



CBLK 









1 


* • 


CD 


CD 


CD/ 










CM 


CM 


CM 


CM 










CD 






CM 




CO 




LO 


o 


CD 


o 


O 


cd 


CD 


CD 


CD 



X" CBLK 
232 



IKLCLK 
OULCLK 
EN.BH 

EN.GH 

ADO 

SUB 
EN_0 

MUX_0UT70 ^ 
MUX.0UT61 ^ 
MUJL0UT43 
MUX_0UT52 

POSTC 



CM CM 
CD* 

r-^ to 

O CD 

CD <_> 



CM 
i-D 



CD 
CM 



CM 

o o 

CD C-> 



POSTC 
233 



FIG. II 



T_OUT[21:0]> 



158 



EP 0 891 094 A2 



| SEL P > 
I SEL BYP 

I IN CLK > 




159 



EP 0 891 094 A2 




CM 
O 



160 




162 



EP 0 891 094 A2 




163 



t 

EP 0 891 094 A2 



CBLK 



-IN0[2 1 ],IN0[2 1 : 1 ] JN0[2 1 :2] 
(SR2 ond sign-extended) 



IN3[2 1:01V IN3|21].IN3[21:1] 



(SRI ond sign- extended) 



IN2|21:0|V IN2[21].IN2[21:1J 



(SR1 ond sign -extended) 




NIN1SR[21:0] 



i 



INHZliOD- INI [2 1 J.IN1 [2 1 : 1 ],IN1 [21 :2] i 
(SR2 ond sign-extended) 
M1C[21K)) 

M1S(20:0) 



IN2[21:0] 




MUL C15 




M3C[20:0] 
M3S[10:OJ 



MULC353 




MULC352 




GNO 
M1S[20:0] 
IN2[21] 
M2C[20:0] 

GND.GND 
M2S[10:0] 

M2C[2&0] 

IN2[21] 

M2S[10:0] M ^C[20:0) 
GND.GND 
M3S[10:0] 

M4C[21:0] 




T3S[21:0] 



T3C[21:0] 



M4S[21:0] 




T2S[21:0] 



i 



T2C[21:1] 



MULMC15 



FIG. 14a 



164 



EP 0 891 094 A2 



231 



CBLK 



T3C[21:1] 
GNO 




T2C[21:1] 
GNO 



0OS[21:0] 



00C(21:1] 
00C[21:lj 

GNO 




RESO 




01S[21:0] 



01C[21:1] 
01C[21:1] 

GND 



0UT0[21:0l > 




RES1 



h 



T3C[21:1] 
GNO 




O3S(21:0] 




03C[21:1] 
03C[21:1] 

voo 




RES3 



O2S[21:0] 



T2C[21:1] 
GNO 



02C[21:1] 
02C[21:1] 

VOO 




RES2 



FIG. 14b 



165 



EP 0 891 094 A2 




166 



EP 0 891 094 A2 





167 



EP0 891 



094 A2 




168 



EP 0 891 094 A2 




MUX OUT52> 
MUX 0UT43> - 
MUX 0UT6l) >- 
[MUX 0UT7py 
S 



R52[2I:0) 



R43[21:0) 




169. 



EP 0 891 094 A2 



TDOUT 



NTDIN 
TSH 0— E 
MTADIN — £" 
TSH A 

0JCK1 
OJCKO 
NTRST — C 





FM.SCAMP TO SCANP 
INVERT SCAN SCANTEST 

scanS'pd mK ~ M 










f 




JTCKO CONTROL 
NTLRESET tJ 


1 




> 





TOIN 

SCANTEST 



NTDOUT 



NTADOUT 



JTAGCTL 



UPA0DR[3:0] 
J LU 0DATA[7:0] 



[ 



, luPRSTR! 

Dm 



RAMTEST 



UPAD0R(3:0] 

LUPDATA[7:0] 
UPRSTR 

UPWSTR 
RAMTEST 



IN 0ATA(11:0] 
IN EXTN 



Tl 



<7 



> 



IN ACCEPT 



IN VALID 



T2 



TDM 



CO 



CL. O. 



2 O 

uo to. 



Q_ CL 



IN_EXTN1 
IN_01[1 1:0] 
IN_VAUD1 
IN ACCEPT 1 

INLEXTN2 

IN_D2[21:0] 

INLVALID2 



LO 
UJ 

or 
I 



^ ^ 3e 



1DIDCT 



CD 



I — ^ « — 



TDIN 



11^ ^ czl^P 

uj uj uj CO rr^ 

on o , 

Z =K O LO tO ^5 



CO 



en 



0UL.EXTN1 
OUTJ)1[2l:0j 
0UT_VALID1 

0UL.EXTN2 
0UT_D2[8:0] 
0ULVALID2 
0UT_ACCEPT2 



CO 



Q- O O 



TDOUT 



IDCTCORE 
IDCTCORE 



if 



o 

UJ. 
CO 



■i 



CC UJ LJ 

UJ fet? CO 



Ao'^ K U , 'r. DD CO CM . 

3 S <-> z 



<_> 



FIG. 16a 



SEL TEST> 
UPAODR[3:0l- 



IDCLSEL 


ISNPSEL(1:0] 


TRANSELAODR 


UPADDR[3:0] 


TRANSELDATA 


CGSEL 




TSNPSEL 


IDCTSELS 




IDCTSELS 





ISNPSa[1:0] 

TRANSELAODR 

TRANSELDATA 

CGSEL 

TSNPSEL 



170 



EP 0 891 094 A2 



SYSCLK^r 
NOTRESET^ 



SCANTEST- 

PTEST--EZZ> 

OJCKO- 
0JCK1 - 



OMODE^E 



CGSEL 
LUP0ATA[2:0] 

UPWSTR 
UPRSTR 



SYSCLK 
NOTRESET 

SCANTEST 
PTEST 

OJCKO 
0JCK1 

OMODE 

LUPDATA[2:0l 
UPWSTR 
UPRSTR 



IDCTCG 




IDCTCG 
IOCTCG 



PHO 
PHI 
NOTRSTO 

TCKM 
TCKS 
NTCKS 
TPHO 



[eg 



PHO 
PH1 

NOTRSTO 

TCKM 
TCKS 
NTCKS 
TPHO 



CO 

or 



0 0- 



01 OE 

WW 



01 00(21:0] 



x 



3C 



IN_EXTN 
1N_0ATA 
IN_ VALID 



21:0) 



TRAM 



0UT_EXTN 
0ULDATA[21:0] 
OULVAUD 



TOIN 



CD V? 

Q£ ^ OC CJ ^ CO J 

===i Z3 cn to o2 oo 



CD 



or m 
cj to z 

Q, UJ 



I 



o z <_> 



T0OUT 



IDTRAM 
IDTRAM 



UPADDR[2:0 
LUP0ATA[7:O 
UPRSTR 
UPWSTR 
TRAMSELA00R 
TRAMSELDATA 
RAMTEST 
TSNPSEL 
NTEST 

02 IE 



T00UT 



eg co cm 



UOI 

. I o 



D2 IV 



) M)2 10(21:0] 



OUT EXTN 

(OUT DATA[8:0] 



D 



FIG. 16b 



OUT VALID 

OUT ACCEPT 



171 



EP 0 891 094 A2 



NON RESET 



FIF1 



FIFO 



I IN EXTN1> 



IN1D1[11:Q] > 



IN.VAUD1> 



< IN ACCEPT1 h 



8 



I . I 



ertn 
doto 

valid 



occept 



00 



8 



8 IN DREG1[11:0] 



8 



<DCHK ACCEPT 



I0IPIF12B 
T1IP2W 



DCHK EXTN> 



DCHK VALID> 




-i 



FIG. 17a 



172 



EP 0 891 094 A2 



IN EXTN2 > 
T 2 | IN Dl[l1.0]> - 

nrvAiiD2> 



T 



extn 
doto 
valid 



enable 



00 



I0IRF22KB 
T2IP2W 



_Ji « 

A IN DREG2[21:0] 


> 

8 


f 

< 


IN 0R£G(7:2] 





FIG. 17b 



173 



EPO 891 094 A2 




T2_IN 
[21:0] 



T1_IN[11:0] 



TRANSFORM 



T1.0UT[21:0] 



CD 



CO 



CD ..... 

?2 mvntn 



LU CO CD u_t CD CD 



CM 



CO 



i . i GD CD 



r». Lki uj r-. lu lu 



co _Z3 __3 i — ) — 



to 



J I 



I t 



CM 



to to 

I I 

csi 
I I 



LOtOtOtOtOtOtOtO 



fO rsi CD h-j cm 



CD 



cot to 

<39ooo'ooooo 1 
» » i i i i i i i i 

CM CM -r- «— CSI CM CM CM Q 




T2.0UT 
143:0] 



IIDCT ENB > 



f IN DREG 1 [2:0] 


355 tn 

UUVI 

OCHK EXTN a='fc'« 
DIN[7:0] ° S 
DCHK VALID = 


85 

S EXTN OUT 
DOUT[7:0] 
VAUO OUT 


ISR1 FXTN^ 


>— _ — _ 




f — SRI OOUT[7:0] 






SRI VALID > 


Hin VALlbREG1> 




ID.DSR1 

ID_DSR1 J 


CM 



or 
co 



FIG. 17c 



174 



J* 



EP 0 891 094 A2 



IN_DATA(6:0] 

OATA-TOK 

IO0KDEC 
T1_0UTDATAT0K 



HIN DATAOKQ 



IN EXTNREG2 



IN DREG2[21:0] 



IN VAL1DREG2 



m EXTNREG2 > 



ID_DSR2 

} 1 IN VAUDR£G2> »D_0SR2 



IN_DATA[6:0] 

DATA_TOK 



IN_QATA[6:0] 

DAT/LTOK 



HSRUDATAOK > 



BXJOKDEC 
TUOUTDATATOK 



EXTNJN 




EEXTN_OUT 


E0OUT[7:0] 


OIN[7:0] 


EVAUD_OUT 




EXTN_OUT 
DOUT[7:0] 


VALID_IN 





CM 

cn 

CO 



HSR2_EEXTN> 
-SR2JE0OUT[7:O] 

{SRO0O3> 

-SR2_EXTN 

- SR2_D0UT[7:0] 



MH DATA0K2> 



!N_DATA[6:0] 

OATA^TOK 



ID.TOKOEC 
T2JNDATAT0K 



ID_TOKDEC 
T2_0UTDATAT0K 



FIG. I7d 



•{srO^SQK> 



175 



EP 0 891 094 A2 



|T1_N0T_QATA> 
"&ATA> 



TEL 



^ 



TL0OUT[21:0] 



SR1_EXTN 



GND.GND.GND.CND.CND 
GND.GNO.GNO.GNO.GND A 
GND.CND.GNO.CND ~f 

SR1_DOUT[7:0] 



»-o 



1BUCVAUD1> 



IDCLENB 



SELDl 
SELD2 


CO 
UJ 

or 
1 




o 


extn 




doto 




valid 






enable 


00 



8 



8 



8 



HOUT_EXTN1 > 
- OUT D1[7:0]> 
HOUT^VAUDI > 



ID0PIF22KB 
T10PIF 



FIG. I7e 



176 



EP 0 891 094 A2 



GO 
I 

I 



CM 



A(U3:0] 
F[8:0] 



I00PMUX 
IDPPMUX 



T1 N 


ot data; 




ti.oata; 



SR2_EXTN 
T2_00UT[8:0] 



GND.SR2_DOUT[7:0] 
IBLK_VALID2> - 



<QUT ACCREQ2 



1 



cj cj co 



I. I 



on 



CO CO — 



extn 
doto 

volid 



accept 



(K) 



IDOPIF22KB 
T10P2F 



HOUT EXTN2> 
- OUT_02[7:0]> 

- iOUT VAUD2 > 



OUT ACCEPT2 1 



FIG. I7f 



177 



EP 0 891 094 A2 



0UT_CLK 
IN CLK 



T1_000[1:0]> 
T2_OOO[7:0l> - 



t_> c_> 

I . i 



T1_000[1:0] , NPUT 
T2_OOO[7:0] BUFFER 



CD 



CO 



10.0PIN 
10_0PIN 



in 



r~. ^ www 

^ -t 1 ^t 1 m 

mn 



INO 
INI 
IN4 
IN5 
IN2 
IN3 
IN6 
IN7 



21:0 
21:0 
21:0 
21:0 
21:0 
21:0 
[21:0 
21:0 



If) 



* ' 

5 



CO 

I 



CO 

I 

CM 



CO 

I 

GO 
I 

CM 



T_IN0[21:0) 



L 



231 



T_IN1[21:0] 


T_IN4[21:0] 


TJN5 


21:0] 


T_IN2 


21:0] 


T_IN3 


21:0] 


T_IN6 


21:0] 


T_IN7 


[21:0] 



T_IN0[21:0] 



INI 

_IN4 

T_IN5 
T_IN2 
TJN3 
T_IN6 
T IN7 



21:0 
21:0 
21:0 
21:0 
21:0 
21:0 
21:0 



3" MS[21:0]U 



PREC 



10_0P1ST 
10.0P1ST 





U-J 
CO 


CO 
Q_ 


Q_ 

& 


A 






Lj_I 


CO 


CO 


Q_ 


YP 




m 



P2[21:0] 
P1[21:0] 

P3[21:0] 



GO 
HO 
G1 
HI 
G3 
H3 
G2 



21:0 
21:0 
21:0 
21:0] 
21:0 
[21:0 
21:0 



H2[21:0 



c_> t_> 



CD 



CD 



P0STC 
PARTS 



T1_D0UT[21:0] 



bJ CD CD L*J CD 



CD CD 
CD =D 
«< CO 



CD CM 
I I 



I CD | 



• CD CM 

t I I 
CM CM 



CO 



LU CO CO CO CO 

I t f f I 

H-> fO 
CM CM "^T 

I I I I I 

CM " — " — CnI CM 



CO CO 

I I 

r-. co 

CD 

I — I — 

o o 



co co 
I I 

CO 

r-n cm 



o o 



"i"i"i"iT2JX)UT[21:0] 

r-^ to in 1 J 

CD « — ro CM 



O O CD CD 
III! 

CM CsJ CM CM 



10_0P0UT 
10_0P0UT 



o m qd m co cd m m i i i i j _j i i i i i j 



CO 



FIG. 18a 



I 

I-— 
CO 

o 
I 



CM 



CO 



CM 



CM 
I 

CM 



CM 



CO 



CM 



CO 

I 

I 



CO 

I 

r> 

CM 
I 

CM 



CO 



CM 



CO 



CD 



o 



CO 

I 

to 



CO 



CO 

.A 1 

CM 



CO 



O 



CO 



O 
I 

CM 



CO 



CM 



CO 

I 

ir> 

CM 



CM 



178 



EP 0 891 094 A2 



r 



232 



233 



MS[21:0] 



P2[21:0] 



P1[21:0] 



P3(21:0] 



MS[21:0] 


ST0[2 1 :0] 


P2[21:0] 


ST1[21:0] 

C8LK 

ST3[21:0] 


PI [21:0] 


P3[21:0] 


ST2[21:0] 



10.DP2ND 
10.DP2ND 



H2[21:0 



C2[21:0 



H3[21:0 



I 



ST0[21:0] 



ST1[21:0] 



ST3[21:0] 



ST2[2 1 :0] 



t — I 



ST0[21:0] s '_ 

ST1[21:0] pos ^ 

ST3[21:0] PART A 

ST2[21:0] 



GO 
HO 



CD CD 



G3 
H3 
G2 



i i 



21 
21 



GH21 
H1 



[21 
[21 
21 



[21 
H2[21:0] 



0 
0 
0 

o] 
o] 
o] 
o] 



I0.DP3RD 







CD 


m 






1 


1 






OD 


CJ) 



C3[21:0 



H1[21:0 



G1[21:0 



H0[21:0 



G0[21:0 



RAS_IN[21:0] 



RAS_IN[21:0] 

RAS_OUT[21:0] 



RAS_0UT[21:0] 



10_0PRAS 
10_0PRAS 



<_> CJ 

I ■ I 
OUTPUT 



BUFFER 
BOFF_IN[0:0] 0UTL[143:0] 



CD 



10_OP10FF 
10.0P20FF 



I 



T1_0UT[21:0] ^ > 



T2.0UT[H3:0]^> 



FIG. 18b 



C-> 
CD 



179 




EP 0 891 094 A2 



OUT CLK 
UN CLK 

ISEL DATA 
11 IN[ 11.0] 

T7Wr55 T, - NA , N i D[,1:0] 




=1 



^AN0[l1:0]fJ1_IN0[l1:0] 



T1_INV[1 1:0] 



LATCH2[0] - 



UTCH2[1] 



LATCH2[4] 



LATCH2[5] 



LATCH2[2] 



LATCH2[3] 



LATCH2[6] 



LATCH2[7] - 
I 



0 

3CK k 



LATCH2[8] 



T2_IN0L[21:0] 



0 

C1CK k 



LATCH2[9] 



T2_IN1L[21:0] 



0 

I1CK k 



UTCH2[12] 



0 

tJCK k 



J 



T2_IN8L[21:0] 



T2_IN4L[21:0] 



D 

t(CK k 



LATCH2[13] 



T2_IN5L[21:0] 



D 

dCK k 



LATCH2[10] 



T2_IN2L[21:0] 



0 

t|CK k 



LATCH2[11] 



T2_IN3L(21:0) 



D 



=tCK k 



LATCH2[14] 



T2_IN6L[21:0] 



0 

tJCK k 



J 



LATCH2[15] 



=CCK k 



D 

HCK k 



lT 



T2_IN9L[21:0] 



D 



J 



T2_IN12L[21:0] 



0 

t jCK k 



J 



T2_IN13L[21:0] 



0 

t(CK k 



J 



T2_IN10L[21:0] 



D 



J 



=CCK k 



T2_IN11L[21:0] 



D 

HCK k 



J 



T2_IN14L(21:0) 



D 



J 



«CK k 



-| T2_IN7L[21:0] 



T2_IN15L[21:0] 



FIG. 19a 



i 

L 



T2 LATCHES[0-15] 



UTCH2[0] 



LATCH2[1] 



LATCH2[4] 



UTCH2[5] 



LATCH2[2] 



LATCH2[3] 



LATCH2(6) 



=TCK k 



LATCH2[7] — tqCK_k 



CCK k 



1h 



T1_INOL[11:0] 



D 



"L, 



T1_IN1L[1 1:0] 



D 

acK k 



T1_IN4L[11:0] 



0 

flCK k 




T1_IN5L[11:0] 



0 

acK k 



i-, 




T1_IN2L[11:0] 



D 

3CK k 



T1_IN3L[1 1:0] 



D 



=£CK k 



1— , 



T1_IN6L[1 1:0] 




T1_IN7L[1 1:0] 
T1 LATCHES[0-7] 



180 



4 



EP 0 891 094 A2 



T2_IN0[21:0] 



T2_IN8[21:0] 



T2_IN9[21:0] 



T2_IN12[21:0l 



T_IN4L[11:0] T1_IN4[1 1.0] -v 

GND.GND.GND.GNO.GND 4 
GND.GND.GND.GNO.GND J 

T2_IN5[21:0] 



T2_IN13f21:0l 



T_IN5L[11:0] T1_IN5[1 1.0] -» 
GND.GND.GND.GNO.GND 4 
GND.GND.GND.GNO.GND J 

T2_IN2[21:0] 



1 



T2_IN10[21:0] 



T_IN2L[11:0] T1_IN2[1 1.0] a 
GND.GND.GND.GNO.GND 4 
GND.GND.GND.GND.GND J 

T2_IN3[21jO] 



T2_IN11[21:0] 



T_IN3L[11:0] T1_IN3[1 1.0] ^ 

GND.GND.GND.GNO.GND 4 
GND.GND.GND.GND.GND J 

T2_IN6[21:0] 



I 



T2JN14f21:0] 



T_IN6L[1 1:0] T1_IN6[1 1.0] A 
GND.GND.GND.GNO.GND 4 
GND.GND.GND.GNO.GND J 

T2_IN7[21:0] 



T2_IN15[21:0] 



T_IN7L[11:0] T1_IN7[11.0] 



)\ I l_IN/[ I I .UJ "\ 

GND.GND.GND.GNO.GND 4 
GND.GND.GND.GNO.GND J 



tn 



I 

CD 



CM 



T_IN0L[11:0] T1_IN0[11.0K ff 
GND.GND.GND.GNO.GND 4~ * J 
GND.GND.GND.GNO.GND J 

T2_IN1[21:0] 



T_IN1L[1 1:0] T1_IN1[11.0] -y If 
GNO.GND.GND.GND.GND f— ' ! 
GND.GND.GND.GNO.GND J 

T2_IN4[21:0] 



tn 

cvi 



LATCH 1 [7:0] >f LATCH 1 [7:0] 



LATCH2[15:0]> +- LATCH2[15:0] 

FIG. 19b 

B 



CK k 



T_IN0B[21:O] 



1 



T_IN0L[21:0] 



CK k 



T_IN1B[21:0] 



T_IN1L[21:0] 



CK k 



T_IN4B[21:0] 



T_IN4L[21:0] 



CK k 



T_IN5B[21:0] 



T_IN5L[21:0] 



CK k 



T_IN2B[21:0] 



T_IN2L[21:0] 



CK k 



T_IN3B[21:0] 



T_IN3L[21:0] 



CK k 



T_IN6B[21:0] 



T_IN6L[21:0] 



CK k 



T_IN7B[21:0] 



T_IN7L[21:0] 



lT_IN0[21:0] > 



_INV0[21:0] 



1T_IN1[21:0]> 



T_INV1[21:0] 



k> lT_IN4[21:0] > 



T_INV4[21:0) 



k> [T_IN5[21:0]> 

T_INV5[21:0] 



k> 1TJN2[21:0]> 



T_INV2[21:0] 



1TJN3[21:0] > 



T_INV3[21:0] 



k> |T_IN6[21:0] > 



T_INV6[21:0] 



k> |T_IN7[21:0]> 

T_INV7[21:0] 



DATAPATH SLAVELATCHES 



~1 



_J 



181 



EPO 891 094 A2 



|l IN0[21:0] > 




ONRAOD 
PJA 



FIG. 20a 



182 



EP 0 891 094 A2 



||N CIK 


> 




foUT CIK 


> 



i 



COIF 21:0] 




C45F[21:0] 



M512T.0] ) 



\ P2121:0) ) 



NPU21] PU2V.01 — rPlT2U0r> 
PI [20:0] 

C23F[21:0] g 

NPl[2ll— {k>>— (k>— PU21] 

P1_21 P1_21BUFF[1:0] 



^ NP3I21] 
5 ~\- P3(20:0] 



C67F[21:0] 



P3[21:0] 



B 



MP31211— ^>o — ^>>-P3l211 
P3.21 P3_21BUFF[1:0] 



FIG. 20b 



183 



EP 0 891 094 A2 



M5[21],M5[2!],M5{212] 
(SR2 ANO SIGN-EXTEND] 
I M5f21:0) ) M5(2l.-0] 



I 25 
M5[J]-(NT| 

I 24 
M5[0]— |NC] 



P2[21:0] > 



P2[21J.P2[21],P2[21:2] 
[SR2 AND SIGN- EXTEND] 

P2[21:0] 



Nt[21:01 



NIN1SR2[21.-0] 



I 23 
P2[1]-[NC] 

I 22 
P2[0]— [NC] 

PI[2I].PI[21:1> 

[SRI AND SIGN -EXTEND] 

Pl|21:0] > 




P1[2I:0] 



I 21 
PlfO] — |NC] 



P3[21].P3[21:1> 
[SRI AND SIGN- EXTEND] 

P3[21:0] ) 



P3[21:0] 



I 20 
P3[0]— (NC) 



U1CJ21.-0] 



M1S[20:0J 



DMULC1S 
IDMULC1S 




IDMULC3S 
IDMULC3S3 



M3C[20:0] 
M3S[19:0] 




IDMULC3S 
IDMULC3S2 



M2C[20:0] 
M2S[19:0] 




M4Cf 21:01 



I 



GND 
M1S[20:0 

P3[21] 
M2C[20.0] 
GND.GND 
M2S[19.0] 




IDCSADD 
BT3 




J 



— cj s> M4Sf21:0) [ 

IDMULNC1S 
I0MULNC1S 



P1{21] 
M3C[20:0] 
GND.GND -V->o 
M3S[19.-0]" J ^V|_^ : 

b sc 
b 



T3S[21:0] 




T2S[21:0] 



IDCSADD 
BT2 



T2C[21:1] 



FIG. 21a 



184 



EP 0 891 094 A2 



T3C[21:1 
CND 




OCSADD 
CSO 



O0S[21:0) 



I 



-O0C[21:1] 

00C[21:1 
CND 




IDNFRADDNSB 
RESO 



-BT0[21:0] > 



1 19 



NC 



T2C[21:1 
GND 




DCSADO 
CS1 



01S[21:0] 



f 

-01C[21:l] 
01C[21:1 



GND 




IDNFRADDNSB 
RES1 



^T1[21:0J > 



1 18 



NC 



T3C[21:1 
CND 




DCSSUB 
CS3 



03S[21:0] 



03C[21:1] 

03C[21:1R_ 
VDO^ 




IDNFRADDNSB 
RES3 



BT3[21:0] > 



1 17 



NC 



T2C[21:1 
' GND 




SWAPPED! 



DCSSUB 
CS2 



O2S[21:0] 



-02C[21:1] 
02C[21:lK_| b 
VDO^i 





IDNFRADDNSB 
RES2 



BT2[21:0] > 



1 16 



NC 



FIG. 21b 



185 



EP 0 891 094 A2 



H0C(21:0] 
HOS(21:0] 



H1C[21:0] 
H1S[21:0] 




H3S[2Q:0) 



H2S(20:0] 



BT2F{20:0] 



IDMULOS 
OSUUL 



186 



EP 0 891 094 A2 



H0C[21:1].V0D 



H1C(21:1].VDD 



GN0,H1S[20:2], 
VOD.H1S10) 



H3Cl21:0] 



GNO,H3S120:0] 



GND.H2S[20:0l 



I 68 

H2R[21] OS 



FIG, 22b 



lCQ[21:0j > 



$wrm > 




{eSS3i> 



msam 



187 




188 



i 



EP 0 891 094 A2 




189 , 



EP 0 891 094 A2 




190 



EP 0 891 094 A2 




191 



EP 0 891 094 A2 




192 



EP 0 891 094 A2 




0UTL[71:63] 



ID.DFE 
0UT3[8:0] 



ID_DFE 
0UT7[B:0] 



FIG. 25a 



193 



EP 0 891 094 A2 





0UTI(143:35J 



KLDFE 
0UT1 1(8:0] 



ID_DFE 
OUT15[8:0] 



0UTI[ 143:0] 



oun[i43.oij> FIG. 25b 



194 



EP 0 891 094 A2 



CSR M(3] 



CSR[2] 



CSR M[2] 



CSR M[1] 




CSR M[0] 



u> 



o 

CO 
<M 

O 



195 



EP 0 891 094 A2 



CO 



CSR U[7] 



CSR[6] 



CSR M[6] 



CSR[5] 



CSR M[5J 



CSR[4] 



CSR M[4] 



CSR[3] 













3 <_> 




CM 





o c_> 



CSR[14J 



CSR M[H] 



CSR[13] 



CSR M[13J 



CSR[12] 



CSR M[12] 



CSR[11] 



or 
to 
















ir> 






1 

to 
c_> 








to 



CO CO 

c_> c_> 



CD 
CVI 



196 



CSR1_M[15:0T^ 
| CSR2_M[31 :6\y 



EP 0 891 094 A2 



— CSR1_M[15:0] 

— CSR2_M[31:0] 



T1_IN\_SEL ~> 



T1_23_SEL > 



\ T1_45_SEL > 

- T1_0UT07_SEL > 

- T1_0UT16_SEL > 

- T1_0UT34_SEl > 

- T1_0UT25_SEL > 



CSR1_M|0] 





CSR1_M[13] 
CSR1_M[11] 



CSR1_M[8] 
CSR1_M[15] 

CSR1_M[9] 
CSR1_M[14] 

CSR1_M[11] 
CSR1_M[12] 

CSR1_M[10] 
CSR1_M[13] 




1_545 




1_632 




1_631 




1.630 




1_548 



1_629 




FIG. 27a 



197 



» 



EP 0 891 094 A2 



CSR1_M[7]— 



1_663 
NC 



CSR2_M[31] — 



1_656 
NC 



1_662 



CSR1_M[6] — 



1_655 



NC 



1_661 



CSR1_M[5]— 



NC 



CSR2_M[30] 



CSR2_M[29] 



NC 



1_654 



NC 



1_660 



1_653 



CSR1_M[4]— NC CSR2_M[28] 



CSR1_M[3]— 



CSR1_M[2]— 



CSR1_M[1]— 



1.659 
NC 

1_658 
NC 

1_657 
NC 



CSR2_M[27] — 



CSR2_M[26] 



CSR2_M[25] — 



CSR2_M[24] 



NC 

1_652 
NC 

1.651 
NC 

1_650 
NC 

1.649 
NC 



1.648 
CSR2_M[23) — ( NC 



1.647 



CSR2_M[22] — 



NC 



1.646 



CSR2_M[21] 



CSR2_M[20) 



NC 



1.645 



CSR2_M[19] — 



CSR2_M[18] — 



CSR2_M[17] — 



CSR2_M[16] — 



NC 

1.644 
NC 

1.643 
NC 

1.642 
NC 

1.641 
NC 



CSR2_M[7] 



CSR2_M[6] — 



CSR2_M[5] 



CSR2_M[4] 



CSR2_M[3] 



CSR2_M[2] 



CSR2_M[1] 



1.640 
NC 

1.639 
NC 

1_638 
NC 

1.637 
NC 



1.636 
NC 

1.635 
NC 

1 .63 4 
NC 



FIG. 27b 



198 



EP 0 891 094 A2 



T2_IN_SEL[1:0]> 



T2_IN_SEL(0] 



> 



T2JM_SEL[1] 




CSR2_M[0] 





T2.IN/LSEL > 



T2_INB_SEL y 



CSR1_M[10] 
CSR2_M[11] 



CSR2_M[8] - 
CSR2_M[15] 

CSR2_M[9] 
CSR2_M[14] 

CSR2_M[11] 
CSR2_M[12] 

CSR2_M[10] 
CSR2_M[13] 





1_628 




1.627 




1_626 




1_553 



\ T2.23.SEL > 



\ T2_45_SEL > 



1.625 




T2_OUT07_SE1~> 



T2_0UT34_SEl~> 



T2_0UT25_SEL~~> 



FIG. 27c 



199. 



EP 0 891 094 A2 




200 



EP0 891 094 A2 




201 



EP 0 891 094 A2 




FIG. 28c 



202 



EP 0 891 094 A2 



1_725 
CSR[7] - fNC] 

1_724 
CSR[6] -[NC] 

1_723 

CSR[5] — [NC] 
1_722 
CSR[4] — fNCl 

Cm 

CSR[3] — [NC~l 
1_720 
CSR[2] -[NC 



1_719 
CSR[1] -fNC] 
1_718 

csr[o] -|nc] 



1_749 
CSR[14] -[NC 



CSR2[31:0]> 



1_74B 

csr[i3] - [nc| 

1-717 

CSR[12] — fNC] 
1_7T6 

csr[ii] - [nc] 

1_715 

csr[io] — [nc] 

1_7H 
CSR[9] - fNC] 

1_713 
CSR[8) -fNC] 

CSR2[31:0] 



T2_D 
T2_D 
T2_0. 
T2_0. 

T2_0. 
T2_D. 
T2_0 



LSEL 



_SEL[2 



SEL 
.SEL 

SEU 
.SEL 
.SEL 



7) 

6] 

[5] 



CSR2[31] 
CSR2(23] 

CSR2[30] 
CSR2[22] 

CSR2[29] 
CSR2[21] 

CSR2[28] 
CSR2[20] 



1_561 
L 



1_537 



1_536 



1_533 



1_675 



1_674 



1_673 



>1 



1_672 



FIG. 29a-l 



203 



• • » 

EP 0 891 094 A2 



CSR2[27] 
CSR2[18] 

CSR2[26] 
CSR2f 1 7] 

CSR2[25] 
CSR2[16] 

CSR2[24] 
CSR2[15] 

CSR2(23J 
CSR2(14] 




1_524 



OSLIP 



1.667 



FIG. 29a-2 



204 



EP 0 891 094 A2 



RECOVER 



IDCT.ENB 
DENB 



DSUP 



1_733 



1_754 



BCNT 
BCNTf2" 



BCNT 1 
BCNT 



■ i 



T2_SCNTt3 
T2.SCNT 
T2_SCNT 
T2.SCNT 



EBSUP 



=3> 



1_683 



BENB 




1_767 



DSUP 



RECOVER- 



1.770 



1_776 



OS 



BS 



RS 



FIG. 29b 



205 



EP0 891 094 A2 



■ 




FIG. 29c 

206 



EP0 891 094 A2 



I — I 



T2_D_SEL[7:0) 




FIG. 29d 



-j T2_0-SEL[7:0l> 




T2_D_SEL[7] 



T2_D_SEL[6] 



1_700 1-708 
C 




T2_D_SEL(5] 



~^y> ^>o— T2_0_SEL[4] 



1_710 1-706 
C 



T2_D_SEU3] 




z>^-T2_0_SEUl] 



1_697 1_703 
C 




T2_0_SELlO] 



1_696 1-702 



207 



EP0 891 094 A2 




NRSLIP 



RECOVER 



DENB 



FIG. 29e 



208 



EP 0 891 094 A2 





209 



EP 0 891 094 A2 



I IN DCNT1 FNR 




NXIDC[5:0] 



IDZINCGE 
ICA00 



NXTDEFERR > 




NIDC[5:01 



SUPLTCH 



to 
o 

CD 



^r— — 



CD 



CD 
CD 



CD 



CD 
CD 



CD 
CD 




DEFERR > 



L274 



L273 



i SUPERR > 



FIG. 30a 



210 



EP 0 891 094 A2 



CEMiMCT> 



IDCT.ENB 



I IDCT STALL > 



STANDARD DATA TOKEN DECODER. 

WILL RECOGNISE DATA.NONE AND OTHER TOKENS 
CODING:- 



token[l] 


token[0] 




0 
0 
1 
1 


0 
1 

0 
1 


NONE 
OTHER 
DATA 
illegal 





- I NOT IT0K1f1:0] ^ 




IN TOKfl:0l > 



FIG. 30c 



212 



EP 0 891 094 A2 



Ai 



CD 




O 

ro 



213 




214 



EP 0 891 094 A2 




NOT.RESET 
0UT_CLK 

ENABLE 
IN_CLK 



55§ ^5 




CM 



Csl <N 



to CO 
Q <3 



AS 



A 



AT 3 



A, 

o 



CM 

CO 




CM 
«X> 
CM 
I 



A 



LO 
CM 



J to 

O 

Ll. 



CM 

O 



CM 



CM 
CO 



215 



EP 0 891 094 A2 



STANDARD DATA TELEN DECODER. 

WILL RECOGNIZE DATA, NONE, AND OTHER TELENO 
CODING > 



telen[l] 


telen[0] 




0 
0 
1 
1 


0 
0 
1 
1 


NONE 
OTHER 
DATA 
illegol 



IN_T0K2[1J 
IN_T0K2[0J 

NEXT_0T0K2[1] 




NEXT_OTOK2[0] 1 '- zo 

FIG. 31c 



216 • 



EP 0 891 094 A2 




218 . 



EP0 891 094 A2 



*t o* o <t 



t 
© 

c 



9 ?taiaciate,t. 

u cxrr £££££ 

A «4 ♦> 4-*— UUUUUUWO 

a a c o> jc aA0Adit*)tf 

333 (PLLLLLLLl 



n " 

0 M 
C 
— ♦* 
I 3 
O 



O L 

.. « 

CM C 



L> t> U U O U UUSJUlZi-^*-- 1 -*- 




CVJ 

ro 



O 

Li. 



219 



• * 

EP 0 891 094 A2 




ro 
ro 




220 ■ 



EPO 891 094 A2 



«h c« r> * w *> r- ors 



" nnnnnnnnn^n 



Ok * 0 0 0 0*00 £ 

VV*- <OO 00 0 0 0 00*f 
ADA ^r^vv^r^-v^T'V 



8 

a o 



c 
♦* 
o 
c 



UI 



15 

•4 O 

*» x> 

• I 



u M u v y y « w u u y f f * f f f r;_5£ ^ 2 



a 3 D- 

so m o i 

0 |PO U " 



o > 
O O M 

«4 «4 n 

TO U O 

• • I 



0 3 

W 

• * I 



— n 

C 0 
0 O 



OOQOO 



oooooooooqoooSooooo 



X _J -I J-i-J 

oooooo 



z z 

QO 



gg 



z 
o 




c 



Si: 



1.1 

I 

■J 

1 

1.1 

8 

I- 

i:. 

«.J 

i 

IJ 

8 

0 

o 

8 

».) 

n 

i 

0 
0 

o 

0 

s 

1:1 

H 



u u 



8 



I 

0 
0 

o 



to 



1 1 



y 

0 
«.l 

H 
o 

0 

0 
0 

0 

I.I 

0 

ft 

y 



o 
o 
o 



10 



© 

G 

r> 

•fx 
r> 



P 



ID 

r» 

*r» 
r» 

M 
8 

in 
n 



o 
n 

n 
«0 




221 



EP 0 891 094 A2 



m q « ai 



■ c 

B 4» 

JC JC O 

a. a c 
» • • 

AAA 

-ix r 



^££C£££4CC 0 

P o-< 

J J J J J J Jjlj" 
OOOQOOOQQO 



. _ . . —r~ — — — — - " ^ «4 «4 

^WNMNWNNNNNUuuuuun 1 
' "WWNNNNCrf 



p M «P ♦>♦*♦» 4» 4» *> 2 +» J> 

o 0-22*222 5 * * 

•* (LL.LLLLLLLL 



r. xlcc c^— c 

u u u u u u CM — 
0 0 0 0 0 0 O N 

••••••CI* 



nn 
com 

« ~ to 

JO LJ 

£$? 

I Q ~f 

•WD 
0 ♦* O 



c 

I ID 

to 

j • 

In 



0 

y 

r 1 



y 

H 

i'V 



000000000000000000 



3 



Jl I 
»t »t 

1 If 1 

.-he 1 
Ol ) 

>iV 
Or j 

•1 

WW 

W»' 



>f 



1 

M 



!-T 1 
W>« 



a. 3 5 — 

E a n n o « 

• IV A u • 

I CM - • «> « 

♦* *t>B • I 
* «l I I W£ 
*l "O £ JC U u 

r ♦> 4> jc - 



_ -1 X 

000000 o 



u 




222 



EPO 891 094 A2 




*> 

L. 
f> 

3 

c 



8 ?S 



r £ 
a a 



l 

o 
c 



3 3 3 



O) o 
o — 

t_ • 

1 L 



£. C C C 
u u u o 
+> *J ♦> 

0 0 0 * 



© c * © 

L L L L 



ESE 



C £ C 
u u u 

0 0 0 



O O €» 
ILL 



*> — 
l_ O 

© n 

0 I 
C 0 

— C 
CM — 
* I 

I** 



CM 

t-J 

3 
O 
13 



- - I 



» s cm r> ^ in to S cos w 

*-JuuuuuuuuuuCCCCCC 



LOW 
d n n 
n t I 

c cn 

- c c 

N 

X I t 

O NN 
|4» ♦> 
W - " 
♦> V V 



nn 

sin 



3* 
O 

!♦» 
M 3 
♦* O 



2:°* 

a 3 — - 
e 0 n n 

© | X> 0 
|c* • * 
M ♦» 9i n 
PUD 

-nil 
nu.cc 

!£♦>*> 



l. a 
v — 

3 — 

o n 

w * 
« *» 

• I 

n c 
u u 
I ** 
c - 



O 3 
O O 
I I 

T5 X) 

* • 



n r-i nC 

»• n 

0 3 «4 C • 

W W Li | 

U II*-* « 4> 

O NN X) P 3 

O TJ Tl — ' 0 O 

I • • 1 TJ ' 
I *- 

«. -o *> 

3 3 - 3 3 

xj -o ▼ u 



3 3 
J3 0 



O D £> -w T *w 



I J I JXJJJJJJJJ 

ooo oooooooooo 



OOOOOOOOOOOOODOOOO 



I X -J J -J -I 
O D O O O O 



X X 
o o 




J J 



r 



i 



J JL 



X X 

o o 



I i 

0 
0 

II 

i: 
».:' 

»< 

9 

i i 

tt 

H 

1:1 

1.1 

i 

I I 
» I 

1:! 

* : 



X 

o 



II 



Li:,, 



••t'l 



CO 

ro 



Lt_ 



223 



EP 0 891 094 A2 




224 



EP 0 891 094 A2 




225 



EP 0 891 094 A2 



CLOCK REFERENCES 



MPEG 
SYSTEMS 
STREAM 



SYSTEM 
DEMUX. 




• DATA 
► TIME STAMP 

• OATA 

• TIME STAMP 



251 

| STREAM 1 
| STREAM 2 



DATA 

TIME STAMP 



ELEMENTARY STREAMS 
EACH WITH ASSOCIATED 
TIMESTAMPS 



250 



j STREAM n 



FIG. 39 



226 



EP 0 891 094 A2 



MPEG 
SYSTEMS 
STREAM 



1 



SYSTEM 
DEMUX 



▼ t 

OTHER 
ELEMENTARY 
STREAMS 
AND THEIR 
TIME 
STAMPS 



cn 



3 

En 



<7> 



253 

CLOCK REFERENCES 












TIME \ <- 



90kH2 



"TIMr 

INITIALISED 
FROM CLOCK 
REFERENCE 



SYSTEM DECODER 



27MH? 



256 



ELEMENTARY 
STREAM 
BUFFERING 



260 



ELEMENTARY STREAM 
DECODER 




257 



EARLY/LATE 



CORRECTION FOR NON-ZERO DECODE TIME 
TO ELEMENTARY STREAM DECODING 



FIG. 40 



227. 



BP 0 891 094 A2 



MPEG 
SYSTEMS 
STREAM 



253 

CLOCK REFERENCES 




? f 

OTHER 
ELEMENTARY 
STREAMS 
AND THDR 
TIME 
STAMPS 



ELEMENTARY 
STREAM 
BUFFERING 



on 
m 



CO 



CO 



"TIME" 

INITIALISED 
FROM CLOCK 
REFERENCE 



TIME ' < 




^-262 





6Zf 



90kHz 



es_lime 



300 < 




rcseLtime 



260 



SYS TEM_[)E CO DE R_ 
ELEMENTARY STREAM 
DECODER 



es_time 



OPTIONALLY RESET THE 
ELEMENTARY STREAM 
DECODER COPY OF 
ES.TIME EACH TIME THIS 
VERSION RETURNS TO 
ZERO. TO AVOID ANY 
POSSIBLE (UNEXPECTED) 
DRIFT OVER VERY LONG 
TIME PERIODS. 





*300 < 



257 



CORRECTION FOR NON-ZERO DECODE TIME 
► TO ELEMENTARY STREAM DECODING 



FIG. 41 



228 



EP 0 891 094 A2 



MPEG 
SYSTEMS 
STREAM 



253 

CLOCK REFERENCES 




OTHER 
ELEMENTARY 
STREAMS 
AND THEIR 
TIME 
STAMPS 



"TIME* 

INITIALISED 
FROM CLOCK 
REFERENCE 



^ 055 90kHz 



300 < 



ELEMENTARY 
STREAM 
BUFFERING 



27MHz 



reseLtime 



LOAD INITAL_TIME 

WHEN RESET_TIME 
ASSERTED 



SYSTEM DECODER 



ELEMENTARY STREAM 
DECOOER 




256 



257 



CORRECTION FOR NON-ZERO DECODE TIME 

»► TO ELEMENTARY STREAM DECODING 



FIG. 42 



229 



EP 0 891 094 A2 



MPEG 
SYSTEMS 
STREAM 



253 




CLOCK REFERENCES 



CLOCK 
RECOVERY 




"TIME" 

INITIALISED 
FROM CLOCK 
REFERENCE 



SYSTEM DECODER 



27MHz 



256 



VIDEO 
DECODING 
BUFER 



VIDEO 
DECODER 



t300 



270 



EARLY/LATE 

CORRECTION FOR NON-ZERO DECODE TIME 
TO VIDEO DECODING 



FIG. 43 



230 



EP 0 891 094 A2 



MPEG 
SYSTEMS 
STREAM 



253 

s> 

CLOCK REFERENCES 




27MHz 



\ T 

AUDIO 
DATA AND ITS 
TIME 
STAMPS 



SYSTEMOECODER 
VIDEO STREAM " 
DECODER 



VIDEO 
DECODING 
BUFER 



OPTIONALLY RESET THE 
ELEMENTARY STREAM 
DECODER COPY OF 
ES.TIME EACH TIME THIS 
VERSION RETURNS TO 
ZERO. TO AVOID ANY 
POSSIBLE (UNEXPECTED) 
DRIFT OVER VERY LONG 
TIME PERIODS. 



271 




EARLY/LATE 



270 



CORRECTION FOR NON-ZERO DECODE TIME 
TO ELEMENTARY VIDEO DECODING 



256 



FIG. 44 



231 s 



EP 0 891 094 A2 



MPEG 
SYSTEMS 
STREAM 



27.027MHz 




CORRECTION FOR NON-ZERO DECODE TIME 
TO ELEMENTARY VIDEO DECODING 



FIG. 45 



232 



EP 0 891 094 A2 



TOKEN INTERFACE 



MICROPROCESSOR INTERFACE 




FIG. 46 



233: 



EP 0 891 094 A2 




FIG. 47 



234 



EPO 891 094 A2 



I 



WRITE CONTROL CIRCUITRY 



WRITE 




I 



WRITE STROBES j WRIT E ADDRESS 

8 



1 



SWING BUFFER 



14 



SWING BUFFER 



READ DATA 



1 



10 



12 



READ STROBES 



READ ADDRESS 



8 




READ CONTROL CIRCUITRY 



SWING UNIT 



FIG. 48 



235 



EP 0 891 034 A2 




CM 



236 



EP 0 891 094 A2 



DATA IN 



34 



T 



WRITE 
ADDRESS 



WRITE CIRCUITRY 

• : -:-\'\ 1 r 



32 



\ I 



• ■ j * 



i- r 



• » 



§ 



CO 



•• • (• I- 

• • • 

* * i 

* * • I 

• • ■ i • 
•I. 



* 1 - I 



READ CIRCUITRY 



READ 
ADDRESS 



J 



36 



DATA OUT 



FIG. 50 



237 r , 



EP 0 891 094 A2 




EP 0 891 094 A2 



INPUT 




TABLE 
SELECT 



SELECTOR 




'28 

' v > 





FIG. 52 



239 



EP 0 891 094 A2 




240 



EP 0 891 094 A2 



o 

m « 



to 

8 




CO 

o 
o 



— » I » » 

too 
o 




CM 



O 

• • 1 1 1 

CS tO 



CD , 



2k: c-?T°. 



o 



u o u q: 



Q£ > 

3 =s cj> cJ ^=tr— iLoxjG 
o5f=-OLOLj(/)u- 




o 



Jo 1 
<_> o 



o o 

• • • • 

O UJ 

3 r>j , — 
O io g 



I » ■ >ii i 1 ■ - 
CD CD CD X O 

< in ^- u 



P5 



o o 



Of 

I 



u uj y: ^ _ 
co * — 1 — fvi to 



X 



I 



CO 



o 

CO 





5*r 












> O 



3 



si 



o 



r 













be 







1 



* • 

I — 

CM 

O 
CO 



1 



ro 
in 



O 

CO 



241 



EP 0 891 094 A2 




242 



EP 0 891 094 A2 



LSREC[31:0] 



SEL0UT[27:12 
CMDREG[3:0; 



IN CLK> 



OUT ACC REG 



<DUT ACC RECh 




NXVSL(31:0] 



VLC[15:0] 




DATA(15.0] 


TABLE[3:0] 






IN.CLK 






OUT.CLK 
ENABLE 


VLC 

DECOOER 
ROM 


' UPRSTR 

■ UPWSTR 

■ LU0ATA|7:O] 
. UPISEL[1:0] 



PHROM 
PHROM 



UPRSTR. 



UPWSTR 

<fLU0ATA[7:O 
> — i — - 



ITT 



UPISEL 



1:0 



OUT.VALIO 





VLC_SIZE[4:0] 
R0M_RCN[4:0] 
0EC_VLC[5:0] 



< $UT ACCEPT 



<jFTCLK 



FIG. 53d 



243 



EP 0 891 094 A2 



GO 

JEJTL 



EJTU 



pooo 

53 §S 



old 



CM 



ri_ri 



cn 



to 
cn 



=J"1 



2! 5 



A0DR[5:0)> 



CD 

^ ^ 



OJT 



oooo 
oUo'o 



i i 



to 



CNI 



9 




DECODE 

'1222' 



DECODE 
•0112* 



DECODE 

•ooir 



DECODE 
'0010' 



SWVDD 




ADDEC 


WUNE 


IN.CLK 





R0WDRV3 



SWVDD 
AD0EC 
IfLCLX 



WUNE 



RBROWDRV 
R0WDRV2 



SWVDD 
ADDEC 
IN.CLX 



WUNE 



RBROWDRV 
R0WDRV1 



SWVDO 
ADDEC 
IN_CLK 



WUNE 



RBROWDRV 
R0WDRV0 



ADDR(0] 
ADDRtU 



RBC0LAD4 
1_1B9 



FIG. 54a 




rS 




244 



EP0 891 094 A2 




lO CM 

SELO E£i 
SEL1 
SEL2 
SEL3 

IN_CLK_B 
0UT_CLK 
ENBL 
UPlW 

UPSEL 
UPIR 



RBCMUX4 
L188 



A. 



STR 
SEL 



RBUPISTR 
UPISTRO 





SELO s 5E 5: 5: 

SEL1 
SEL2 
SEL3 

IN_CLK_B 
0UT_CLK 
ENBL 
UPlW 

UPSEL % 
UPIR & ^ 





























RBCMUX4 
LI 87 



O- 



H PATAU :Q) > 
-[NUP^njl> 



FIG. 54b 



245 



EP 0 891 094 A2 




246 




247 



EP 0 891 094 A2 



(ADDRESS LINES) 




N0R- 
BASED 
WORDLINE 
DRIVER 



11 xx 



NOR- 
8ASE0 
WORDLINE 
DRIVER 



01xx 

D- 



NOR- 
BASED 
WORDLINE 
DRIVER 



D01x 



o— 



NOR- 
BASED 
WORDLINE 
DRIVER 



0001 



NOR- 
BASED 
WOROLINE 
DRIVER 



0000 



O- 



FIG. 57 



248 



EP 0 891 094 A2 




249 



EP 0 891 094 A2 




250 



EP 0 891 094 A2 




251 



EP 0 891 094 A2 



325 

I 



326 



327 



328 



IMODEL 




I2Z 




IQUANT 












IDCT 



FIG. 61 



CONFIGURE 
PIN 



210 

? 


C VERT 




UPSMP 






V 




OUTPUT 
MUX 









TRIGGER SAV/EAV 



CONFIG 
TO PARSER 



^ Y 



Cr/Cb OR 
Y/Cr/Cb 



332 



VTG 



SYNC'S ETC 



333 



FIG. 62 



252 



EP 0 891 094 A2 



254 



■ — r 



AUDIO TS 



MPEG 
SYSTEMS 
STREAM 



SYSTEM 
OEMUX 



AUDIO DATA 



VIDEO 

DECODER 

BUFFER 



271 



253 




CLOCK REFERENCES 




SYSTEMS DECODER 



VIDEO DECODER 




CORRECTION 
FOR NON-ZERO 
DECODE TIME 



90 kHz 



256 



270 



EARLY/LATE 



TO VIDEO DECODE 



FIG. 63 



253 



EP 0 891 094 A2 



MPEG 
SYSTEMS 
STREAM 



VIDEO 

DECODER 

BUFFER 



CORRECTION 
FOR NON-ZERO 
DECODE TIME 



90 kHz 




256 



RESET THE VIDEO 
DECODER'S VERSION 
EACH TIME WE 
RETURN TO ZERO 



CORRECTION COULD 
MODIFY TIME- 
STAMPS INSTEAD OF 
CLOCK REFERENCES 



EARLY/LATE 



TO VIDEO DECODE 



FIG. 64 



254 



EP 0 891 094 A2 



320 

/ 

UPI 



201 



FROM SCO 




TIME_ STAMP 



PICTURE^ START 



4804.8/4800 




ts_high 



ts_high 



MICROPROGRAM MABLE 
STATE MACHINE 




ts_low 



ts_Jow 




218 



FIG. 65 



255 . 



EP 0 891 094 A2 




AUXILIARY ADDRESS 



FIG. 66 



^ EVALUATED CONDITION 



INSTRUCTION 



DATA IN 




r 



219 



! < 



I 



: TOKEN PORT(INPUT) 



< 



4> 



-K 





H(CARRY BIT) 



4" 

T r 



ALU T FLAGS 



222 



REG. FILE 



FIG. 67 



M 



221 



A REG. 



A 



B REG. 



► B 



STATUS REG. 



H 



REGISTER FILE 



256 



EP 0 891 094 A2 



L J 



H 

_J 

CARRY BLOCK 



CONDITION BLOCK 



J" 



ALU CORE 



▼ t 

F FLAGS 



ii 



C* 



FIG. 68 



REG. FILE 



A REG. 



B REG. 



STATUS REG. 



M 



-*» B 



H 



FIG. 69 




FIG. 70 



257 



EP 0 891 094 A2 




i ■ 



FIG. 71 



258 



EP 0 891 094 A2 




FIG. 72 



259 



EP 0 891 094 A2 




FIG. 73 



260 



EP 0 891 094 A2 




FIG. 74 



261 



EP 0 891 094 A2 




FIG. 75 



262 



EP 0 891 094 A2 



*":'»—"> 



- "r 



• • * ■ 

■ • * >f * 



5 :~-\ 



•' I 

r k • 



V • • * 

• t' -1 



■ \ 

-/'. 



.».-.* 



.*.**' %\ 
:*.• 

4 4 ■ 

' ■ ".■>■" 



* * * • 

. » • 




CD 




263 



EP 0 891 094 A2 

* 



* 







. m ^ • ■ - 
.- • 

• " ■ '. ' . • • 


■>.: 

* . . . - " 


.•*.•, . • ; • ••] 
• •/ .*.» 'j . - 9] 






■ • V V • ■ 
. • • m • 

* • " >■ 


* ♦ • " ' « . 


: « 

• .*"■ » *■*•*" •* I 










••••;--.-v- ; :^?.. 



FIG. 77 



264 



r 



EP 0 891 094 A2 





265 



EP 0 891 094 A2 



FIG. 79 



266 



EP 0 891 094 A2 




FIG. 80 



267 . 



EP 0 891 094 A2 




FIG. 81 



268 



r 



EP 0 891 094 A2 




269 



EP 0 891 094 A2 




FIG. 83 



270 



FIG. 84 



271 



BP 0 891 094 A2 




i ■ 



• 



FIG. 85 



272 



V 



EP 0 891 094 A2 





FIG. 86 



273 



EP 0 891 094 A2 




274 



EP 0 891 094 A2 




INPUT PELS \9 

OUTPUT PEL POSITIONS 



PHASE 0 



256 



PHASE 1 


1 




6/ 


\220 


42 












PHASE 2 
PHASE 3 




) 




i128 


128 










42 


220 


-6 








PHASE 0 0 


256 


0 



FIG. 88 



275 . 



EP 0 891 094 A2 




FIG. 89 




FIG. 90 

276 



EP 0 891 094 A2 





FIG. 91 



EXTENT OF PICTURE(HORIZONTAL_SBE) 




SHOULD HAVE REPEATED 
THIS PEL 



DUBIOUS 
DATA 



REPEATED 
PEL 



FIG. 92 



Pa DATA 



LAST PEL IN FIELD 



LAST_IN_UNE 



FIELDJD 



FIRST PEL OF NEXT FIELD 





J 



FIG. 93 



277 



EP 0 891 094 A2 




278 



EP 0 891 094 A2 



V SD SB 



V SD SB 




S0=SELECT DATA 

SB=SELECT BOR0ER(DASHED LINE IS SB') 

F= DIGITAL FIELD INDICATOR 

V= DIGITAL VERTICAL BLANKING(DASHED LINE IS V) 



c 



FIELD SYNC 
] EQUALIZATION SYNC 



ANALOGUE BLANKING 
INACTIVE 



FIG. 95 



279 



EP 0 891 094 A2 




FIG. 96 



280 



EP 0 891 094 A2 



HALF-LINE COUNT 
(TO V MACHINE) 



H CONTROL 



HBORDER PELS 



10 OTS(360u) 



COUNTER(UP) 



HARDWIRED C0MPARAT0RS(x12) 



BORDER REGISTER 



SUBTRACTOR(PICTURE WIDTH) 



MUX2 



COUNTER(DOWN) 



HARDWRED ZERO DETECT 



FIG. 97 



281 



EP 0 891 094 A2 




282 



I 



EP0 891 094 A2 




283 



■ 



EP 0 891 094 A2 



CODED 
CLOCK 



HANDSHAKE 



SYNCHRONOUS 
DRAM 



r 



8 



CODEO / 
DATA ' 



r— LxJ 



co 

CO 



a: 

C5 



16 



SDRAM 
INTERFACE 



MICROPROCESSOR 
INTERFACE 



o LU 



to 

UJ 
CD 
O 

i 



8 



Cr/Cb 



SYNCS, BLANKING 



NTSC/PAL 



27MHz 



FIG. 100 




FIG. 101 



284 



i 



EP 0 891 094 A2 




CD i o i o i o i o i o n 



d d Id 



d d 



FIG. I03 



1 xx 



0 xx 




0 


0 


1 





1 0 17 



FIG. I04 



286 



EP 0 891 094 A2 



CDC10CK 



K. 



r 



CDACCEPT 



COVALIO 
C0[7:0] 
COEXTN 

8M0DE 




FIG. I05 



287 



EP 0 891 094 A2 




288 



EPO 891 094 A2 




289 




290 



EP 0 891 



094 A2 




291 



EP 0 891 094 A2 




292 



EP 0 891 094 A2 




293 



EP 0 891 094 A2 



FIG. 1 1 2a 



4 

5 

6 

7 

8 

9 

10 

11 

12 
13 
14 

15 

16 

17 
18 

19 
20 

21 

22 

23 
24 



FIELD 1 

mrn^ HELD SYNC. 




E 



BLANKING 



EMPTY ACTIVE LINES 



I--.- ^ -.. ........ 



::.,r-.j 



261 

262 

263 
264 

265 



266 CZ 



9 ^. — -t : .-. -.- -» « «. •>■» •. .«».r . • : -*-• 



All 



. : •■ ■• "■ -V- .•: « " ■■{ 




F V 
0 1 

0 1 

0 1 
0 1 

0 1 

0 1 
0 1 
0 1 
0 1 
0 1 
0 1 
0 1 
0 1 

0 1 
0 1 

0 1 
0 0 
0 0 
0 0 
0 0 
0 0 



0 0 
0 0 
0 1 

0 1 

1 1 

0 0 



orCBLANK IS ACTIVE BUTY(7:0] STILL OUTPUT BLANKING 



294 



EP 0 891 094 A2 



266 

267 
268 



FIELD 2 

MIM RELD SYNC - 



k-yv.:-^-;::-- V,'A'.^'.-r-.-y.^,l 



272 C 



E 



273 C 

274 E 
275 
276 
277 

278 

279 



£ 



E 



[ 



[ 



280 [ 

281 

282 [ 

283 • 
284 



FIG. 1 1 2b 




r 



c 



BLANKING 



] 



EMPTY ACTIVE LINES 



1 



3 



3 



> 













2 

k 1 





F V 
1 1 

1 1 
1 1 
1 1 

1 1 

1 1 

1 1 

1 1 

1 1 

1 1 

1 1 

1 1 

1 1 

1 1 
1 1 

1 1 
1 0 

1 0 

1 0 

1 0 

1 0 



523 

524 

525 
1 





' 476 




\^^\rD^yi^^^ 478 






EQUALIZATION 




1 0 
1 0 
1 0 

1 1 
1 1 

1 1 



o:CBLANK IS ACTIVE BUTY[7:0] STILL OUTPUT BLANKING 



295 



EP 0 891 094 A2 



FIELD 1 



FIG. I 1 3a 



t • .■.."' t ,i'"i- 



» 3— r 



1 

2 

3 
4 

5 

6 

7 

8 
9 

10 
11 
12 

13 
14 

15 

16 

17 

IB 

19 
20 

21 

22 

23 
24 



25 r~~i%m&m&£zk&x;>M 



FIELD SYNC 



EOUAUZAT10N 



BLANKING 



E 



FV 



^ 0 1 




3 0 1 



J 0 1 



j 0 1 



] 0 1 



3 0 1 



3 0 1 



J 0 1 



] 0 1 



] 0 1 



J 0 1 



3 0 1 
3 0 1 



3 0 1 



] 0 1 



] 0 1 



3 0 1 
3 0 1 



D 0 1 




MMMSMMM 0 0 

0 0 

0 0 
0 0 
0 0 

309 mMMMMMMM 572 y jMMM MMgM 0 0 

310 MSMMiMM ESMii JZME&ggSZgiBB 0 0 



311 r s s s s s s ss^_j o i 

312 r S EQUALIZATION [ V 2 

313 I I I l M I l\ 

o: CBLANK IS INACTIVE BUT Y[7:0] STILL OUTPUT BLANKING 



0 1 

1 1 



296 



EP 0 891 094 A2 



FIG. 1 1 3b 



FIELD 2 



313 





FIELD SYNC. 

L J 






isgl&Mitlffl 






I/.*, w 1 .' .•?■■«: » if r-o v": f. ■! .'• ■•>•' •»•! 





317 
318 
319 







EQUALIZATION 












r— ^ j y a i : ^ i 



320 EH 



3 



321 E 



322 E 



— ~~t 



323 E 



324 E 



325 E 



326 L 



327 E 



328 E 











BLANKING 









3 



D 



329 E 



330 E 



331 E 



332 E 



333 E 



334 E 



335 E 



] 



fr^-r-N agHlS r "..-:v .■-..v. A 
W.-.'. , .:«--y..>W.-. , «l!-. B J.l.-J I 



571 



3 



573 ^•■^^^'v^^^l 




0 
0 



1 0 
1 0 
1 0 

1 1 

1 1 



a: CBLANK IS INACTIVE BUT Y(7:0] STILL OUTPUT BLANKING 



297 



EP 0 891 094 A2 




298 



EP 0 891 094 A2 




299 



EP 0 891 094 A2 



n/c 



CLK 
CKE 
WE 
CS 
RAS 

CAS 
A11 

A10 

A9 



0Q0 TO DQ7 
LOOM 
UDQM 



1Mx16 
SDRAM 



FIG. 1 1 6 







300 



EP0 891 094 A2 



D0[7:0] 
DD[15:8] 
DA[8:0] 

DA(9] 
DA{10] 

DBS 
DCAS 

URA5 



DCS 
DCS 



:u 

DWE 
DCKE 

0CLK0UT 
0 CLKIN 



2 



to 



co 5^ 

lT> 



T 



UJ 













3 s 





301 



EP 0 891 094 A2 



DD[7:0] 
0D[15:8] 
DA[8:0] 

DA[9 
DA[10 

DBS 
OCAS 

DCS 
DCS 



1 

0 



OWE 
DCKE 

DCLKOUT 
0 CLKIN 



T 



00 



3 



<J> t-J 



O Or 
■ I 



5553 g g 3 




3 



to 



to 



S3 



302 



EP 0 891 094 A2 



DD[7:0] 
D0[1 5:8] 
0A[8:0] 

DA[9' 
DA[10 

DBS 
OCAS 

_DT8S 
DCS[lJ 
DCsjoj 

OWE 
DCKE 

DCLKOUT 
0 CLKIN 



38 



Silas 



7 



< 



3£ S3 



X 

Ol 



CO 



CM 



CO 



o 

Ll. 



303 



EP 0 891 094 A2 



SCO DETECTS 
END OF PICTURE 




N 



GENERATE 
FLUSH TOKEN 




N 



GENERATE 
FTAG_ PICTURE. END_ EVENT 
INTERRUPT 




N 



N IS 

AFTER_PICTURE_DISCARD 



=1? 



SET 

DISCARO_ALL 
TO 1 



GET NEXT 
INPUT TOKEN 




CONTINUE 
PROCESSING 



FIG. 120 



304 



EP 0 891 094 A2 



SET NON-ZERO 
START_CODE_ SEARCH 




GET NEXT 
INPUT TOKEN 



SET DICARD_ALL TO 
0. (DISCARD FLUSH 
TOKEN) 



GET NEXT INPUT 
TOKEN OR START CODE 



N 





N 



N 



DOES START 
CODE SATISFY SELECTED 
SEARCH? 



SET 

START_CODE_SEARCH 
TO 0 




GENERATE 
END_ SEARCH. EVENT 
INTERRUPT 




1 



CONTINUE 
PRXESSING 



FIG. I2I 



305 



EP 0 891 094 A2 



MPEG 
SYSTEMS 
STREAM 



27MHz 




TIME STAMP 
INFORMATION VIA 
CODED DATA OR 
MICROPROCESSOR 
INTERFACE 



FIG. I22 



CODED 


MICRO- 




DATA 


PROCESSOR 


CO 
UJ 


INTERFACE 


INTERFACE 


UJ 









306 



I 



EP 0 891 094 A2 




307 



EP 0 891 094 A2 




308 



CO 

< 

<T> 



(19) 





(12) 



(88) Date of publication A3: 

12.01.2000 Bulletin 2000/02 

(43) Date of publication A2: 

13.01.1999 Bulletin 1999/02 

(21) Application number . 98202243.6 

■ i 

(22) Date of filing 28.07.1995 



Europaisches Patentamt 
European Patent Office 

Office europ6en des brevets (11) 

EUROPEAN PATENT APPLICATION 

(51) bit a 7 : H04N7/26 





EP 0 891 094 A3 



(84) Designated Conti acting States: 


(72) 


Inventors: 


AT BE CH DE ES PR GB IE IT LI NL PT SE 


• 


Smith, Colin 






Bristol, BS18 4XF (GB) 


(30) Priority: 29.07.1994 GB 9415413 


• 


Finch, Helen Rosemary 


07.06.1995 GB 9511569 




Gloucestershire GL12 7ND (GB) 


(62) Document number(s) of the earlier application(s) in 


• 


Claydon, Anthony Peter John 




Bath, Avon BA2 6BZ (GB) 


accordance with Art 76 EPC: 






95305308.9 / 0 695 095 


(74) 


Representative: 






Vuillermoz, Bruno et al 


(71) Applicant: 




Cabinet Laurent & Charras 


Discovision Associates 




B.P. 32 


Irvine, CA 92614 (US) 




20, rue Louis Chirpaz 






69131 Ecully C6dex (FR) 



(54) Video decrompresslon 

(57) A parallel Huffman decoder, comprising : 

an input, accepting a mixed data stream comprising 
Huflman coded variable length codes, fixed length 
codes, and data tokens, wherein all complete Huff- 
man codes are received in a single cycle of opera- 
tion ; 

a selector operative on said data stream, wherein in 
a first mode of operation variable length codes are 
provided to said address lines of a Huffman Code 
ROM, and in a second mode of operation data 
tokens and a signal representative of a length of 
fixed length codes is output by the decoder without 
reference to said ROM ; 

a pair of input registers for receiving Huffman coded 
data, both of said registers directing input in parallel 
to said selector ; and 

a Huffman Code ROM for receiving input from said 
selector and another ROM table select input; said 
ROM providing a first decoded data output, said 
selector being responsive to a second output of 
said ROM. 




FIG. I 



O) 
CO 



Q. 

LU 



Primed by Xerox (UK) Business Services 
2 16.7/3 6 



EP 0 091 094 A3 




European Patent 
Office 



PARTIAL EUROPEAN SEARCH REPORT 



Application Number 



which under Rule 45 of the European Patent Convention^ p 98 20 2243 
shall be considered, for the purposes of subsequent 
proceedings, as the European search report 



DOCUMENTS CONSIDERED TO BE RELEVANT 




Category 


Citation of document with indication, where appropriate, 
of relevant Dassaoes 


Relevant 
to claim 


CLASSIFICATION OF THE 
APPLICATION (lnLCI.6) 


X 


KUN-MIN YANG ET AL: "VLSI ARCHITECTURE 

DESIGN OF A VERSATILE VARIABLE LENGTH 

DECODING CHIP FOR REAL-TIME VIDEO CODECS" 
ppnrrrnTMfiQ nr tuf dt/^thm in rfiwrrprwrr hm 

r KULt t UllMbo Ur t ML. KtulUN ill UUnrtKtNut UN 

COMPUTER AND COMMUNICATI SYSTEMS (TENCON), 

HONG KONG, 24 - 27 SEPT., 1990, 

vol. 2, 24 September 1990 (1990-09-24), 

pages 551-554, XP000235934 

INSTITUTE OF ELECTRICAL AND ELECTRONICS 

CI. ull.uLIVO 

* the whole document * 


1-24 


H04N7/?6 


A 


EP 0 562 419 A (GEN INSTRUMENT CORP) 
29 September 1993 (1993-09-29) 
* abstract; figures 1-9 * 


1-24 




A 


EP 0 503 956 A (C CUBE MICROSYSTEMS) 
16 September 1992 (1992-09-16) 
* figure 10A * 


1-24 




A 


EP 0 506 294 A (AMERICAN TELEPHONE & 
TELEGRAPH) 30 September 1992 (1992-09-30) 


1,20 


TECHNICAL FIELDS 
SEARCHED (lntXI.6) 




* abstract; figures 5,8 * 




H04N 




-/— 






INCOMPLETE SEARCH 




The Search Division considers thai the present application, or one or more of its claims, does/do 
not comply with the EPC lo such an extent that a meaningful search frito the state of the art cannot 
be carried out. or can only be carried out partially, tor these claims. 




Claims searched completely : 






Claims searched incompletely : 






Claims not searched : 






Reason (or the Bmtialion ot the search: 






see 


sheet C 







Place of search 



THE HAGUE 



Date of completion of the search 

11 November 1999 



Exarrwnef 

Giannottl, P 



CATEGORY OF CITED DOCUMENTS 



X : particularly relevant H taken atone 

Y : particularly relevant il combined with another 

document of the same category 
A : technological background 
O : n on- written disclosure 
P : intermediate document 



T : theory or principle underlying the invention 
E : earlier patent document, but published on. or 

after the fifing date 
D : document cited in the eppi cation 
L : document cited tor other reasons 

6 : member of the same patent tamoy . corresponding 
document 



EP 0 891 094 A3 




European Patent 
Office 



PARTIAL EUROPEAN SEARCH REPORT 



Application Number 

EP 98 20 2243 



I DOCUMENTS CONSIDERED TO BE RELEVANT 




CLASSIFICATION OF THE 
APPLICATION (IntCI.S) 


Category 


Citation of document with indication, where appropriate, 
of relevant passages 


Relevant 
to claim 





US 5 325 092 A (ALLEN JAMES ET AL) |l,20 
28 June 1994 (1994-06-28) ...... 

US 5 280 349 A (WANG JINN-SHY AN ET AL) |l,20 
18 January 1994 (1994-01-18) 

* abstract * 

* column 6, line 62 - column 7, line 11 * 

US 5 253 053 A (CHU KE-CHIANG ET AL) |l,20 
12 October 1993 (1993-10-12) 

* abstract; figure 13 * 

US 5 173 695 A (SUN MING-TING ET AL) |l,20 
22 December 1992 (1992-12-22) 

* the whole document * 



SHIH-FU CHANG ET AL: "DESIGNING |l,20 
HIGH-THROUGHPUT VLC DECODER PART 
I-CONCURRENT VLSI ARCHITECTURES" 
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS 
FOR VIDEO TECHNOLOGY, 

vol. 2, no. 2, 1 June 1992 (1992-06-01), 
pages 187-196, XP000280658 
* figures 2,7-9 * 

EP 0 576 749 A (PIONEER DIGITAL DESIGN 
CENTRE) 5 January 1994 (1994-01-05) 



TECHNICAL FIELDS 
SEARCHED (lnt.Ct.6) 



EP 0 891 094 A3 




European Patent 
Oft ice 



INCOMPLETE SEARCH 
SHEET C 



EP 98 20 2243 



Application Number 



Claim(s) searched incompletely: 



Reason for the limitation of the search: 
Present claims 1-24 relate to Huffman decoding. 

In respect of the claims of the present application, it is not evident 
where support within the meaning of Article 84 EPC and dislosure within 
the meaning of Article 83 EPC are to be found, because of the very high 
number of pages of description and drawings whose relation to the claims 
1s unclear. 

In particular, the application does not disclose the claimed invention in 
a manner which is sufficiently clear and the claims do not define the 
matter for which protection is sought in clear terms. Consequently, the 
application does not comply with the requirements of Articles 83 and 84 
EPC to such an extent that it is not possible to carry out a meaningful 
search into the state of the art on the basis of the claims as they 
presently stand. 

A search has been carried out for the claims only to the extent to which 
they can be presently interpreted, namely in respect, of the 
subject-matter mentioned in the published appl icati on at pages 48-50, 
with reference to figure 52. 



4 



EP 0 891 094 A3 



ANNEX TO THE EUROPEAN SEARCH REPORT 

ON EUROPEAN PATENT APPLICATION NO. EP 98 20 2243 



This annex lists the patent family members relating to the patent documents cited in the above-mentioned European search report. 
The members are as contained in the European Patent Office EDP file on . . . ■ 

The European Patent Office is in no way liable tor these particulars which are merely given for the purpose of information. 

11-11-1999 



Patent document 




Publication 




Patent lamily 




Publication 


cited in search report 




date 




, member(s) 




date 


EP 0562419 


A 


29-09-1993 


US 


5233348 


A 


03-08-1993 






AU 


3403393 


A 


30-09-1993 








CA 


2091815 


A 


27-09-1993 








JP 


2511788 


B 

*-* 


03-07-1996 








JP 


6104768 


A 


15-04-1994 


EP 0503956 


A 


16-09-1992 


CA 


2062200 


A 


16-09-1992 






DE 


503956 


T 


28-07-1994 








JP 


7131785 


A 


19-05-1995 








US 


5379356 


A 


03-01-1995 








us 


5608888 


A 


04-03-1997 








us 


5649029 


A 


15-07-1997 








us 


5768292 


A 


16-06-1998 








us 


5890124 


A 


30-03-1999 








us 


5864817 


A 


26-01-1999 








us 


5805488 


A 


08-09-1998 


■ 




■ 


us 


5870497 


A 


09-02-1999 


EP 0506294 


A 


30-09-1992 


us 


5220325 


A 


i c t\c i nni 






JP 


2091442 


C 


18-09-1996 








JP 


6252775 


A 


09-09-1994 








JP 


7123229 


B 


25-12-1995 


US 5325092 


A 


28-06-1994 


DE 


4314741 


A 


10-02-1994 






FR 


2693607 


A 


14-01-1994 








GB 


2269070 A, B 


26-01-1994 








JP 


6085689 


A 


25-03-1994 


US 5280349 


A 


i8-oi-iyy4 


DE 


4217008 


A 








JP 


2831888 


B 


02-12-1998 








JP 


7123407 


A 


12-05-1995 


US 5253053 


A 


12-10-1993 


JP 


6217277 


A 


05-08-1994 


US 5173695 


A 


22-12-1992 


NONE 






EP 0576749 


A 


05-01-1994 


CA 


2099172 


A 


31-12-1993 






DE 


69229338 


D 


08-07-1999 








JP 


2834388 


B 


09-12-1998 








JP 


6348492 


A 


22-12-1994 








US 


5768561 


A 


16-06-1998 








US 


5784631 


A 


21-07-1998 








US 


5603012 


A 


11-02-1997 








US 


5907692 


A 


25-05-1999 


« 






US 


5956519 


A 


21-09-1999 


> 

2 






US 


5881301 


A 


09-03-1999 



Si For more details about this annex : see Official Journal of the European Patent Office, No. 12/82 



EP 0 891 094 A3 



ANNEX TO THE EUROPEAN SEARCH REPORT 

ON EUROPEAN PATENT APPLICATION NO. EP 98 20 2243 



This annex lists the patent family members relating to the patent documents cited in the above-mentioned European search report 
The members are as contained in the European Patent Office E DP file on 

The European Patent Office is in no way liable for these particulars which are merely given for the purpose of information. 

11-11-1999 



Patent document 
cited in search report 



Publication 
date 



Patent family 
member(s) 



Publication 
date 



EP 0576749 



US 



5809270 A 



15-09-1998 



For more details about mis annex : see Official Journal of the European Patent Office. No. 1 2/82 



